This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



PROCESS FOR EXCHANGING INFORMATION 
IN A MULTIPROCESSOR SYSTEM 
The invention relates to a multiprocessor system of the type 
comprising a central memory, treatment processors and cache 
memories associated with treatment processors • It also relates 
to a process for the exchange of information between central 
memory and treatment processors via the cache memory associated 
with each of these processors. It also provides a new integrated 
circuit component , capable of equipping the multiprocessor 
system. 

It is known that, in the' most common known multiprocessor 
systems, all the information (data, address) is relayed by a 
common parallel communication bus between the central memory and 
the various treatment processors, which constitutes a bottleneck: 
its transfer rate is in effect insufficient to .feed all the 
processors for full efficiency, from a common central memory. 

For increasing the information transfer rate, a first 
solution consists in associating with each treatment processor 
a cache memory which, by the location of the information, permits 
reducing the demands on the central memory. However, in the case 
in which the volume of data shared between processors is 
substantial, the maintenance of coherence of the data between 
memories generates complementary information traffic on the 
communication bus which resists' a significant reduction of the 
overall flow on this bus, and therefor removes a large part of 
the interest in this solution. 

Another solution consists in providing the communication bus 
in the form of a grid network designed as a "crossbar", which 
permits a direct communication between each treatment- processor 
and each subassembly of the central memory (memory bank) . 
However, this solution is very heavy and very costly to achieve 
because of the very great number of interconnections, and it 
becomes completely unrealistic beyond about ten treatment 
processors. Moreover, in the case of multiple" demands of several 
processors on the same memory bank, such a solution implies 
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access conflicts, a source of slowing up the exchanges. 

Another more current solution by reason it its architectural 
simplicity consists in associating a local memory with each 
treatment processor for storing specif ic data therein, and 
storing the transferred data in the common . central memory. 
However, the great deficiency of this architecture is its non- 
transparency, that is, the need for the programmer to organize 
the detail of the allocation of data in the various memories, 
such that this solution is of a very constrained usefulness. 
Moreover, in the case of high volume of transferred data, it may 
lead as before, to a saturation of the access bus in the central 
memory. 

A solution which has been called "aquarius architecture" has 
been proposed by the University of Berkeley and consists in 
improving the aforementioned crossbar solution by combining with 
the crossbar network, for the shared data, cache memories which 
are connected to the crossbar network, and for the shared data, 
distinct cache memories which are connected to a common 
synchronization bus. This solution contributes a gain in speed 
of exchange but remains very heavy and very costly to achieve. 

The present invention seeks to provide a new solution, 
permitting considerably increasing the flow rate of information 
exchange, while retaining an architecture which is transparent 
for the user, much simpler than the crossbar architecture. 

An object of the invention is thus to permit notably 
increasing the number of treatment processors of the system, 
while benefitting from a high efficiency for each processor. 

Another object is to provide a structure of an integrated 
circuit component, permitting a very simple realization of the 
architecture of Itfiis new multiprocessor system. 

To this end, the multiprocessor system provided by the 
invention is of the type comprising a central memory (RAM) 
organized in blocks of information (bi) , treatment processors 
(CPU X . . .CPUj. . .CPU n ) , a cache m mory (MCj) connected to each 
treatment processor (CPUj. . .CPUj. . .CPU n ) a cache memory (MCj) 
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connected to each treatment processor (CPU,, and organized in 
blocks of information ( b| ) of the same size as those of the 
central memory, a directory ( RGj , and its management processor 
(PG j} associated with each cache memory (M Gj , , means for 
communication of addresses of blocks between processors (CPU.) 
and a central memory (ram, . According to the present invention 
the multiprocessor system is provided with: 

an assembly of shift registers, termed memory shift 
registers (RDM1. . .RDM,. . .RDMn, , each register (RDM., of this 
assembly being connected to the central memory (RAM) 'in such a 
manner as to permit, in one cycle of this memory, a parallel 
transfer in read or write of a block of information (b s , between 
said register and said central memory; 

shift registers, termed processor shift, registers 
(RDPl...RDP j ...R D Pn,, each processor shift register (RDP,, being 
connected to the cache memory (M Cj ) of a processor (CPU.) in such 
a manner as to permit a parallel transfer in reading or writing 
of a block of information (b.) between said shift register (RDP., 
and said cache memory (MCj) ; • 

an assembly of series links (LS,. . .LS.. . .ls„) , each 
connecting a memory shift register (RDM,, and a processor shift 
register (RDP,, and adapted to permit the transfer of blocks of 
information (b,, between the two registers considered (RDM, 
RDP,,. * 

Thus, in the multiprocessor system according to the 
invention, the exchanges between the cache memories and the 
associated processors are carried out as in the conventional 
systems provided with cache memories. By contrast, the exchanges 
between the central memory and the cache memories is carried out 
in an entirely original manner. 

Each transfer of an information block (b,, from the central 
memory (RAM, to the cache memory (M Cj , of a given processor 
(CPU,, consists of: 

transferring, in a cycle of the central memory, the block 
(b s , of said central memory (RAM, to the memory shift register " 
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(RDMj) (of the size of one block) which is directly connected to. 
the central memory and which corresponds to the processor (CPUj) 
considered, 1 

transferring on the corresponding series link (LSj) the 
contents of this memory shift register (RDMj) to the processor 
shift register (RDPjj (of the same capacity) which is associated 
with the cache memory (MC,) of the processor considered (CPUj) , 

transferring the contents of said processor shift register 
(RDPj) to the cashe memory (MCj) . 

In the opposite direction, each transfer of information 
blocks (bj) from the cache, memory (MC,) of . a given processor 
(CPUj) to the central memory (RAM) consists of: 

transferring the block (b,) of said cache memory considered 
(MCj) to the processor shift register (RDP,) which is associated 
with said cache memory (MCj) , 

transferring on the corresponding series link (LSj) the 
contents of the processor shift register (RDPp to the memory- 
shift register (RDMj) , allocated to the processor considered 
(among the assembly of shift registers (RDMl. . .RDM,. . .RDMn) 
connected to the central memory (RAM) , 

transferring in a cycle of the central memory, the contents 
of the memory shift register (RDMj) to said central memory (RAM) . 

In these conditions, the transfer of each block of 
information (b,) is carried out, no longer through a parallel 
bus as is the case in the known systems, but by the series! links 
of high flow rate. These series links permit obtaining 
comparable times of transfer for each block (bj) and even lower 
than the transfer times in known parallel bus systems. The 
comparative example given hereinbelow with the current values 
of the parameter for cur-rent technology, illustrates clearly this 
fact which seems paradoxical. 

It is assumed that each block of information (b t ) is of a 
size equal to 64 octets. 

In the system of the inv ntion, the transfer time between 
the central memory and a cache memory breaks down into: 
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a central memory transfer time (RAM) /memory shift register 
(RDM,) : 100 nanoseconds (performance of a central random access 
memory of known type) , 

a series transfer time on the corresponding series link: 
64 x 8 x 1/5 0 0. 10 6 f either 1024 nanoseconds, assuming a transfer 
frequency of 500 megahertz (not exceptional with current 
technology which permits attaining frequencies of 3000 
megahertz) , 

a processor shift register transfer time (RDP,) /cache memory 
(Me,) : 50 nanoseconds (cache memory of the very current type) . 

The total time of transfer of a block is. therefor on the 
order of 1200 nanoseconds (while integrating the chaining delays 
of the second order) . 

In known systems with cache memories in which the exchanges 
of information is carried out directly in parallel by words of 
4 . octets (the most current systems leading to busses of the 
conventional type of 32 data lines), the transfer time for one 
block is equal to the transfer time of 16 words of 4 octets which 
comprise this block, that is: 16 x 100 = 1600 nanoseconds. 

Thus, it is seen that, with the average hypotheses in the 
two solutions, these times are comparable. But, if one compares 
the architecture of the system according to the invention with 
that of a parallel bus common with cache memories (first solution 
mentioned previously) , it will be realized that: 

in the conventional solution (common parallel bus), the 
central memory and the; common bus are occupied at 100% during 
the transfer, since the information circulates between the two 
for the entire transfer time, 

in the system according to the invention, the series link 
is occupied 100% during the transfer, but the central memory is 
occupied less that 10% of the transfer time (time of memory 
reading and loading of the memory shift register (RDMj) ) , such 
that the central memory may serve 10 times more processors than 
in the preceding case (the use of the' series link being without 
significance since it is private and directed to the processor) . ' 
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It is important to emphasize that in this system of the 
invents, each series connection which connects each processor 
in . an individual manner to the central memory is a simple 
connection (of one or two data leads, , such that the series 
network, thus constituted is not comparable in the overall pi an 
to the complexity with, for example, a crossbar network of which 
each connection is a parallel connection with multiplicity of 

all of 11 laadS ° f ^ CO *> arati ^ sample above, , with 

all of the necessary switches. 

Further, as will be seen below on the comparative curves, 
the system according to the invention has greatly improved 
performance w lth respect to the traditional common bus systems 
and permrts in practice operating a much higher number of 
processors (of several tens to a hundred processors, . This 
Performance is compatible with that of a crossbar system, but 
the system according to the invention i= „. 
architectural simplicity. * U< * 

in the system of the invention, eaoh series link may in 
practrce be achieved either by means of two unidirectional series 
Irnks for bit by bit transfer, or by means of a single 
bidirectional series link. 

In the first case, each memory shift register; (RDM.) and 
each processor shift register (RDP,) are divided into two 
registers, one specially for the transfer in one direction, 
the other for the transfer in the other direction. The two 
unidirectional series links are then connected to the divided 
memory shift register (RDM,) and to the corresponding divided 
processor shift register (RDPj) , in such a manner as to permit, 
for one, a transfer in one direction, and for the other, a 
transfer in the other direction. 

This embodiment with two unidirectional "links presents the 
advantage of not requiring any transfer management on the link, 
but the inconvenience of doubling the necessary resources (link, 
registers). " v ' 

in the second case, a validation logic of the transfer ' 



direction is associated with the bidirectional link such as to 
permit an alternate transfer in the two directions on said link. 
This logic may be integrated in the management processor (pg.) 
associated with the cache memory (MCj) to which said 
bidirectional link is connected. 

• It will be understood that each series link may ultimately 
be provided with a higher number of series links. 

In the multiprocessor system according to the invention, 
the address communication means may cover essentially two forms 
embodiments: in the first case, it may consist of a parallel 
address communication bus for blocks (BUSA) , common to all of 
theprocessors (CPUj) and connecting the latter and the central 
memory (RAM) in a conventional manner with an arbitrator bus (AB) 
adapted to manage access conflicts on said bus. It is necessary 
to note that this address bus is only utilized for communication 
of addresses of blocks: in the plan of the structure, this bus 
is identical to the parallel address communication bus of known 
systems, for which no problems of saturation are interposed, 
since it will be freed right after transfer of the address block. 

However, another embodiment of this address communication 
means may be considered in the multiprocessor system of the 
invention, consisting in operating the series links for transfer 
of blocks of information (b,) to transfer the addresses of these 
blocks. 

In this case, a complementary shift register (RDCj) is 
connected to , each series link (LSj) in parallel with the 
corresponding memory shift register (RDCj) . * The addresses 
transmitted by said series link are thus loaded into each of 
these complementary registers (RDCj) . An access management 
arbitrator connected to said registers (RDCj) and. to the central 
memory (RAM) is thus provided for selecting the addresses 
contained in said registers and for managing the conflicts of 
access to the central memory (RAM) . Such an arbitrator is 
conceivably known it itself, this type of access conflicts being 
now resolved for a number of years. In this embodiment, the 
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presence of a parallel communication address bus is avoided, but 
the management resources are made more heavy. 

Further, the multiprocessor system according to the 
invention is particularly well suited for managing in an 
efficient manner the problems of coherence of the data shared 
between treatment processors* In effect, the conventional 
solutions for managing these shared data find their limits in 
the known systems from the fact of the bottleneck at the level 
of the communication of information, but become, on the contrary, 
perfectly satisfactory and efficient in the system of the 
invention where such a bottleneck no longer exists, such that 
this system may be equipped with shared data management means of 
an analogous concept to that of known systems. 

For example, one traditional solution of shared data 
management consists in avoiding the relay of shared data by the 
cache memories: in a conventional manner, a partition logic 
(LPj) is associated with each treatment processor (CPUj) in order 
to differentiate the addresses of the shared data and those of 
the non-shared data so as to direct the first directly toward the 
central memory (RAM) and the second toward the corresponding 
cache memory (MCj) . 

In a first version of the architecture according to the 
invention, the system comprises: 

a special bus for parallel communication of words (BUSD) 
connecting the processors (CPUj) and the central memory (RAM) , 

a partition., logic (LPj) associated with each processor 
(CPUj) and adapted to differentiate the addresses" of the shared 
data and those of the non-shared data in such a manner as to 
transmit the non-shared data on the address communication means 
with their identification, 

a decoding logic (DEC) associated with the central memory 
(RAM) and adapted to receive the addresses with their 
identification and to direct the data into the memory output 
either to the corresponding memory shift register (RDMj) for the 
non-shared data, or to the special word communication bus (BUSD) 
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for. the shared data. 

This solution presents the advantage of being very simple 
in the architectural plan. The presence of the special parallel 
communication bus (BUSD) leads to better performances with 
respect to a solution which consists in utilizing the series, 
connections for transferring not only the blocks of non-shared 
data but also the words of shared data. It should be noted that 
this latter solution may, in some cases, be provided in case of 
low flow of shared data. 

In another version, the system is provided with a special 
bus for parallel communication of words and a special common bus 
for communication of addresses of words (BUS AM) in order to 
transfer the data by the special word bus (BUSD) , and direct the 
non-shared data to the address communication means (which may 
comprise a parallel communication bus where the communication is 
carried out by the series links) . 

■ The presence of a special bus for communication of addresses 
of words permits, in this version, to move back the saturation 
limit of the address communication means, in case of high demand 
for shared data. 

Another version which will be preferred in practice in the 
case in which the address communication means comprises a 
parallel address communication bus (BUSA) consists in providing 
the system with a memory management processor (PGM) associated 
with the memory (RAM) and a snooper processor with a bus (PEj) 
associated with each treatment processor (CPUj) and to the 
corresponding management directory (RGj) . The memory management 
processor (PGM) and each {espion} processor, of structures known 
in themselves , are connected to the address communication bus 
(BUSA) in order respectively to oversee and to treat the 
addresses of blocks transmitted on said bus in such a manner as 
to permit an updating of the central memory (RAM) and of the 
associated cache memory (MCj) in case of detection of an address 
of a block present in the associated directory (RGj) . 

"The memory management processor (PGM) and each snooper 
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processor (PE,) associate status bits of each block of 
information, holding them open as a function of the nature (read 
or write) of the requirements of the block which transitions on 
the bus (BUSA) and assures the coherence of the shared data while 
using these status bits which permit them to force or not write 
a block into the central memory at the moment of the requests on 
the bus (BUSA) . 

In the case referred to previously where the communications 
of addresses are made by the series connections, the management 
of shared data may also be assured in a centralized manner, by 
a memory management processor (PGM) associated with the central 
memory (RAM) and a processor for maintaining the coherence of the 
shared data (PMCj) associated with' each treatment processor 
(CPU,) and with the corresponding management directory (RG-) , each 
coherence maintenance processor being connected to a 
synchronization bus (SYNCHRO) controlled by the memory management 
processor (PGM) , in such a manner as to permit an updating of 
the central memory (RAM) and of the associated cache memories 
(MCj) in- case of detection of an address block, an updating of 
the central memory (RAM) and the cache memories (MCj) at each 
address selection in the complementary shift registers (RDCj) „ 

As before this operation is assured due to the status bits 
associated with each block of information by the processor (PGM) , 

It should be noted that a synchronization bus of the type 
hereinafter defined may, in some cases, be provided in the 
preceding architecture where the address of blocks move on a 
common address bus BUSA. In this case, the snooped processors 
(PEj) are urged by the memory management processor (PGM) via the 
synchronization bus, and this only when they are concerned by the 
transfer. Thus, non-useful access to the cache memories is. 
avoided. The snooper processors become then passive (since 
driven by the processor PGM) and they are designated more by the 
more appropriate expression "coherence maintenance processor" 
according, to the terminology hereinabove utilized. 

Another solution consists in reserving the parallel address 
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communication bus (BUSA) for the transfer of addresses of blocks 
of shared data and using the series links for the transfer of 
blocks of non-shared data. 

Further, the multiprocessor system according to the 
invention lends itself to the regroupings of treatment processors 
on a same series link, in such a manner as to limit the series 
links and the corresponding memory shift registers (RDM,) 
necessary. 

The number of memory shift registers (RDM,) may correspond 
to the number of series links (LSj) , to which case each memory 
shift register (RDM,) is connected in a static manner to a series 
link (LSj) specifically appropriated to said register. 

The number of memory shift registers (RDM,) may also be 
different from that of the series connections (LS.) and in 
particular less,, in which case these registers are connected in 
a dynamic manner to the series links (Lsp through an 
interconnection network. 

As in conventional systems, the central memory (RAM) may 
be divided into 'm' memory banks (RAM^ . .RAM P . . .RAMJ arranged in 
parallel. Each memory shift register (RDM,) is then comprised 
of m elementary registers (RDM,i. . ,RDM,p. . .RDM,m) connected in 
parallel to the corresponding series link (LSj) . However, a 
level of supplementary parallelism and a better electrical or 
optical adaptation of the connection is obtained in a variation 
in which each memory bank RAMp is connected to each processor 
CPU, by a series link from point to point LSjp. 

In order to provide transfer performance at least equal to 
those of conventional systems with a parallel bus, the system 
according to the invention is preferably synchronized by a clock 
of a frequency F at least equal to 100 'megahertz. The memory 
shift registers (RDM,) and processor shift registers (RDP,) may 
very simply be of a type adapted to present a shift frequency at 
least equal to F. 

In the case of- very high frequencies (particularly greater 
than 500 megahertz with current technology) , the registers may 
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be divided into sub-registers of a lower shift frequency, and 
then multiplexed. 

The invention also relates to a multiport series memory 
component, susceptible of equipping the multiprocessor system 
previously described, in , order to simplify the fabrication. 
This component, which may have different applications, is 
constituted by an integrated circuit comprising a random access 
memory (RAM) of a pre-determined width corresponding to a block 
of information (bi) , an assembly of shift registers 
(RDM lr . -RDMj. ..RDMJ , each of a capacity corresponding to the size 
of. the memory, an internal parallel bus (BUSI) connecting the 
access of the memory and the shift registers, a selection logic 
of a shift register (LSR) adapted to validate the connection on 
the. internal bus between the memory and a predetermined shift 
register, and an assembly of external input/ output pins for the 
input of addresses to the memory (RAM) , for the input of 
addresses to the selection program (LSR), for the input and the 
validation of transfer commands in read or write of a block of 
information (bi) between the memory (RAM) and the shift registers 
(RDM,), for the input of a clock signal to each shift register. 
(RPMj) for the input bit by bit of a block of information (bi) 
to each shift register (RDMj) and for the output bit by bit of a 
block of information from each shift register (RDMj) . 

This component may be made parametrable by the adjunction 
of configuration registers (Rq, RC 2 , . . .) permitting particularly, 
a choice of sizes of blocks of information (bj) and of diverse 
modes of operation of the shift registers* 

The invention having been described in its general form, 
is illustrated by the description which follows in reference to 
the accompanying drawings which show without limitation several, 
embodiments. In these drawings, which form an integral part of 
the present description: 

FIGURE 1 is a block diagram of a first embodiment of the 
multiprocessor system- according to the invention; 

FIGURE 2* is a diagram giving the calculated performance 
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curve of this system (A) and, by way of comparison, the 
corresponding curve (B) for a conventional multiprocessor 
architecture with a common bus; 

FIGURES 3, A and 5 are detailed logic schematics of 
functional units of the system of FIGURE 1; 

FIGURE 6 is a block diagram of another embodiment of the 
system ; 

FIGURE 7 is a block diagram of a system of the type as that 
of FIGURE 1, provided with shared data management means; 

FIGURE 8 is a detailed logic diagram of a sub-assembly of 
the system of FIGURE 7; 

FIGURE 9 is a block diagram of a system analogous to that 
of FIGURE 7 with a variation in the shared data management means ; 

FIGURE 10 is a block diagram of an analogous system, 
provided with a different .shared data management means; 

FIGURES 11/ 12a, 12b, 12c, 12d, 13, 14, 15, 16 17 are 
detailed logic schematics of the functional units of the 
. processor system of FIGURE 10 ; 

FIGURE 18 is a block diagram of a system of the type as 
that of FIGURE 6, provided with means for shared data management; 

FIGURE 19 is a simplified block diagram of a variation of 
the system, in which several central units share the same series 
link; 

FIGURE 20a is a block diagram of a preferred embodiment, 
in which the central memory is organized into several memory 
banks , 

FIGURE 20b is a variation of the architecture shown in 
FIGURE 2 OA; 

FIGURES 21a and 21b schematically show another structure 
of the memory RAM susceptible of equipping said system; and 

FIGURE 22 is a block diagram showing the structure of a 
multiport series memory component, capable of equipping the 
system. 

• The apparatus presented -in the form of a block diagram in 
FIGURE 1 is a multiprocessor system having n treatment processors 
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CPU 1 ...CPU J ...CPU„. Shown in this figure are two treatment 
processors CPU1 and CPU, with their associated logic. Each of 
these treatment processors is of a conventional type, for example 
••Motorola 68020" or "Intel 80386"... and with peripheral 
interfaces and being provided with a virtual memory device. 

The apparatus comprises a central random access memory RAM 
provided in a conventional manner from integrated memory 
circuits: in particular dynamic ram "INTEL," "NEC," 
"TOSHIBA"... of 2S6 Kbits, l Mbits, 4 Mbits... depending upon the 
application. This memory is organized in blocks of information 
b 0 . . . b,. . . of predetermined size t (usually 256 bits per 2 Kbits) 
and the access front of said memory corresponds to the size of 
one block. 

The central memory is connected in parallel to n shift 
registers RDM t . . . RDM,... RDM„ called memory registers, each 
memory register having the size t of one block of information. 
Each of these registers is provided from high speed technology 
("ASGA") , one latch able to be loaded or unloaded in one cycle 
of the central memory RAM. The number n of registers is equal 
to the number of processors CPU,. 

A cache memory MC, is associated in a known manner with each 
processor CPU,. Each cache memory comprises a conventional high 
speed random access memory, of low capacity with respect to the 
central memory RAM. A directory RG, and a management processor 
PG, are, in a traditional manner, connected to the cache memory 
and to the treatment processor for managing the information 
flowing in the cache memory. 

Further, in the system of the invention, a shift register 
RDP,- called a processor register . is connected by its parallel 
port to each cache memory MC,. Each processor register RDP, is 
of a size corresponding to that of a block bi and of a structure 
similar to that of the memory registers RDM,. 

Each memory register RDM, is connected by its series port 
to the -series port of a processor register RDP, by a series link 
LS r Examples of the provision of this series link, which may 

14 



comprise a bidirectional link or two unidirectional links, are 
illustrated in FIGURES 4 and 5. The control of the transfer of 
the blocks b, between corresponding registers RDM, and RDP, is 
assured by the transfer logic TFR, and TFR» which are associated 
in a symmetrical manner with the memory register RDM, and with 
the processor register RDP,. "one embodiment of these transfer 
logics (known in themselves) is detailed in FIGURE 3. 

The central memory RAM, memory shift registers RDMj. . .RDM n , 
and associated transfer logics TFR 1# . .TFR^ comprise a functional 
assembly called a "multiport series memory" MMS. The treatment 
processor assembly CPU j; cache memory MC,, management directory 
for the cache RGj, management processor of the cache PGj, shift 
register processor RDP, and the associated transfer logic TFR'. 
constitute a "functional" assembly called a "central unit" UC,/ 

Further, the system comprises means for communication of 
addresses of blocks of the processors CPU,, to the central memory 
RAM, comprising in the example a common parallel communication 
bus BUSA on which are connected the processors CPU, (though their 
management processor PG,) and the central memory RAM. 

Access to the bus BUSA is controlled in a conventional 
manner by a bus arbitrator AB. 

The general operation of the architecture hereinabove 
defined is as follows: 

A processor CPUj carries out a program proper comprising 
instructions, which are found in the form of words in the central 
memory RAM with extracts in the associated cache memory MC,. On 
these instructions of the program, the processor CPU, is brought 
either to read the words of data which themselves are found in 
the central memory RAM or in the cache memory MCj in the form of 
extracts, or to write the words of data in the central memory RAM 
and in the cache memory MC,. 

Each operation of a processor CPU, called . a "request") 
requires the furnishing of the address adr of the word concerned, 
the nature' r,w of the operation (read, write), and the data 
exchange of the word concerned. 
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Each request of a word activates the processor PGj which 
then consults in a conventional manner the directory of the cache 
RGj which indicates if the block bi containing the word concerned 
is present in the cache memory MCj at, in some cases, the frame 
of the block in the cache memory where .the block searched for is 

4 f 

found. 

If the block bi containing the word concerned is in the 
cache memory MCj, then in case of reading, this word is read in 
said cache memory and sent to the processor CPUj. In case of 
writing, the word furnished by the processor CPUj is written in 
the cache memory: the memory transaction is terminated. 

If the block containing the word concerned is not in the 
cache memory MCj, then a reading of the block bi into the central 
memory RAM is necessary. Two cases may be produced. 
First Case 

The cache memory MCj makes use of at least one free block, 
determined by the processor PG, with the help of status bits 
associated with each input of the directory RG,. . In this case, 
the processor • PGj is requesting, in a conventional manner, the 
bus BUSA while referring its demand to the bus arbitrator AB. 
This latter reconciles, in turn, the bus BUSA to the processor 
PGj which agrees while writing to the central memory RAM, the 
block read in the memory being loaded into the register 'RDMj, 
identified by the origin j of the call. The end of the reading 
cycle translates by the liberation of the bus BUSA and the 
activation of the transfer with the series link LSj permitting 
the transfer of the contents of the memory register RDMj in the 
processor register RDP,. The end of the transfer activates the 
writing in the cache memory MC, of the contents of the processor 
register in the placement of the block reserved for this effect- 
and the transaction may terminate as before. 
Second case 

The cache memory MCj is not arranged for free placement 
then, by a conventional algorithm, a placement of the cache is 
made a candidate for receiving the block requested. Two 
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situations may be encountered: 

The block contained in the candidate position has not been 
modified since its installation: it is simply eliminated while 
freeing the frame of the block by a simple writing of a status 
bit in the directory (RGj) and the transaction may proceed as 
before* 

The block contained in' the. candidate position has been 
modified and an operation of the central memory is required. 
For this fact, the management processor PGj transfers the 
candidate block to the processor register RDPj, activates the 
transfer of the processor register RDP. to the memory register 
RDM^, then requests the common bus BUSA while referring its 
request to the arbitrator AB. When the arbitrator admits the bus 
to the management processor PGj, the latter activates a write 
command which has the effect of transferring the contents of the 
memory register RDMj to its position in the central memory RAM, 
The operation of the memory RAM is terminated and the transaction 
may continue as before. 

Thus, in the apparatus of the invention, the exchanges 
between the treatment processors CPUj and their cache memory MCj 
and associated logics RGj, PGj, are carried out in a conventional 
manner. By contrast, the transfers of blocks between central 
memory RAM and cache memories MCj proceed, no longer by a common 
parallel bus, but by the series links LSj dedicated to each 
treatment processor CPU,, the common bus BUSA only serving for 
the. transfer of the addresses and having thus a considerably 
reduced traffic. 

It is known that, for conventional common bus architectures, 
a modelization studied by "PATEL" ("Analysis of Multiprocessors 
With Private Cache") JANAK H. PATEL, IEEE Transactions on 
-Computers,, Vol C31, No. 4, April, 1982) has led to the following 
approximate formula giving the efficiency U as a function of the 
number of processors present; 

1 

u 

1 + m (W+tf ) 
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where the efficiency U is the average rate of utilization of 
each treatment processor, 

m is the probability for a treatment processor to make a 
memory request, not present in its the cache memory (this 
probability m = .Pa is proportional to. the. probability of the 
absence Pa of the information in the cache memory and a factor oC 
which is a function of the power of the treatment processor 
expressed as a percentage of the memory requests) : 

W is the average waiting time of the common bus, which is 
a function of the number of processors , 

tf is the transfer time of a block from the central memory 
to a cache memory. 

The hypotheses from which this formula has been established 
show that it is applicable to the architecture according to the 
invention, with a level of approximation comparable to the level 
of approximation of the formula for conventional common bus 
architecture . 

It is thus possible to compare the performances of the 
two types of architecture while assuming that the components 
common to the two architectures are of identical characteris- 
tics. 

FIGURE 2 gives the curves obtained of the efficiency U as 
a function of the number n of processors for the following 
parameters, the parameters common to the two systems being 
identical, and all of a usual value: 

size of block b, 64 octets, 

size of word for the parallel transfer on common bus = 4 
octets, 

central memory RAM access time = 100 nanoseconds, 
- cycle time of the bus BUSA = 50 nanoseconds, 
series transfer frequency = 500 Mhz, 

probability of absence Pa = 0.005 (cache memory of 16 
octets) , 

power factor of the processors: o< = 0.5. 

It is established by comparing the curves A (architecture 
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of the invention) and B (conventional architecture) that the 
architecture according to the invention has an efficiency 
clearly superior to the conventional architecture. The ar- 
chitecture of the invention permits putting in place a number 
of processors very superior to the conventional common bus ar- 
chitecture which, in practice,' cannot exceed about ten proces- 
sors. For example, in the conventional case, an efficiency of 
0.75 is obtained at the tenth processor, whereas it is 
obtained for more than 80 processors in the case of the 
invention. 

FIGURE 3. presents an embodiment of a transfer logic TFRj 
or TFR'j permitting the transfer of a block bi of information 
from a memory register RDM, to a processor register RDP, (the 
inverse transfer is assured by the symmetric means not shown 
in this figure) . Each logic TFR, or TFR'j comprises a part for 
control of the output TFREj and TFRE'j, and a part , for control 
of the reception TFRRj and TFRR'j which are activated in a 
crossed manner (output TFREj activated in synchronism with the 
reception TFRR'j) . The system comprises a clock generator H 
the frequency of which fixes the speed of transmission and 
furnishes the clock signal h at the output part TFREj and at 
the reception part TFRR'j. 

In the output part TFREj a countdown register DC receiv- 
ing at its load input loadj the reading signal r from the 
management processor PGj permits allowing to pass t + l pulses 
of the clock h t hrough a logic port ET1 controlled by a pass 
to zero signal "borrow", the output of this port ET1 being 
connected to the input of the "down" counting of the countdown 
device DC and to the shift input shiftl of the memory register 
RDM,. .... 

In the reception part TFRR'j, a flip flop B is connected 
by its data input D to the series output of the processor 
register RDP,, the clock input elk of this flip flop being 
connected to the clock H for receiving the signal h. An 
initialization signal "init" furnished by the management 



19 



processor P Gj is connected to the input s of the flip flop b 
and to the loading input lolda of the processor register RDP-. 
The ° UtpUt Q of the fli P transmits a control signal S5= 

transfer to the logic port ET2, permitting the clock signal h 
to pass toward the shift input shift2 of the processor 
register RDP,. This control signal is also delivered to the 
management processor P Gj for indicating the end of transfer of 
the block. 

The operation of the assembly is as follows: the manage- 
ment processor P Gj , after having obtained access to the central 
memory RAM via the bus BUSA, carried out its reading of the 
block bi while furnishing the address of the block concerned 
and the reading signal r. This signal releases the activation 
of the output part TFREj : . the final front of the reading 
signal r provokes the loading of the block bi into the memory 
register RDM, while activating the signal ISadl and the loading 
of the value t + i, corresponding to the size in bits of the 
block bi plus one supplementary bit called "start" , into the 
countdown register DC by the signal lbad 2 . This has for 
effect to reset to 1 the signal boTro^ and authorize the 
transfer clock H to furnish, through the logic port ET1 
conditioned by this borrow signal, t + i clock pulses h: 
these pulses have for effect to shift by the input shiftl t + 
1 bits from the memory register RDM, and to cause to wait for 
the value 0 from the down input to the countdown device DC* 
the signal borrow is reset to zero and locks the operation of 
the output part TFREj. 

Thus, the series connection LSj , initially having the 
logic rest state 1, transfers the start bit 0, then the t bits 
of the block, bi,- and then returns to the logic rest state 1 
the latter bit sent being the value 1 forced on the series 
input of the memory register RDM,. 

As a preliminary to the reading demand, the management 
processor fg, hasjnitialized the receiving part TFRR'. while 
activating the init signal which has the effect of loading the 
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proce ssor register RDPj with t bits to l through the input 
load3 and to set the output Q of the flip flop B to the logic 
state l by the input s. This, output Q then validates the 
logic port ET2 which allows the clock signal h to pass toward 
the input shift2 of the processor register RDP1. At each 
clock pulse this processor register furnishes a bit on its 
series output which is stored in the flip flop B. The first 
bit 0 which is presented has the effect of setting to zero the 
output Q of the flip flop b and locking the clock signal h on 
the gate ET2. This first bit 0 being the start bit "which 
precedes the block hi, this .latter is then trapped in the 
processor register RDP, when the management processor PG, is 
notified of the change of state of the flip flop B by the end> 
transfer signal: the management processor PG, need, only come 
to read this block bi on the parallel output of the register 
RDPj. 

The writing of a block bi to the central memory RAH 
reguires the presence of a logic TFRE'j, identical to the logic 
TFREj, associated with the processor register RDPj, and a logic 
TFRRj, identical to the logic TFRR'j, associated with the 
memory register RDM,. In this case, the signal init of the 
logic TFRRj is connected to the writing signal w: the 
liberation 6f the memory register RDM, automatically rearms the 
receive logic TFRRj. 

This embodiment of the transfer control logic is only one 
possible example: the transmitter register may be in per- 
manent shifting itself, and the receiver register activated 
for t pulses of the clock on detection of the start bit at the 
beginning of transfer. 

The clock H. may be -connected to two registers, or two 
local independent clocks may be used, the synchronization 
being obtained in a conventional manner by an introduction of 
synchronization. 

.The system shown in FIGURE 4 comprises a divided memory 
shift register RDM, and RDMj, a divided processor shift 
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register RDP1, and RDP2j, two unidirectional series links LSI. 
and LS2 jf one connecting the memory register RDM1 to the 
processor register RDPj in such a manner, as to transmit the 
contents of the first to the second, the other connecting the 
memory register RDM2j to the processor register RDP2* in. such 
a manner as to transmit the contents of the second to the 
first, and the associated logics for the control of the trans- 
fer: TFRElj for RDM1; TFRR2j for RDM2 ; TFRE2j for RDP§. ; TFRRlj 
for RDPlj. 

For reading a block of information bi into the central 
memory RAM, the management processor PGj initializes by the 
signal init the logic TFRRlj associated with the processor 
register RDPjl , then activates its demand to read to the memory 
RAM by the read signal r. This signal activates the logic 
TFRElj associated with the memory register RDMlj : this assures 
the transfer on the link LSI, of the block bi of information. 
The end of the transfer is detected by the logic TFRRlj 
associated with the processor register RDPlj which notifies 
the managem ent processor PG, of the arrival of the block bi by 
the signal end -transfer. The management processor; PGj thus 
transfers the contents of the processor register RDPlj into 
the cache memory MCp 

. For writing a memory block bi, the management processor 
PGj loads the processor register RDP2 with the block b; con- 
cerned extracted from the cache memory MC jf which activates 
the transfer of this block on the link LS2 S . The transfer 
logic TFRR2j associated with the memory register RDM2j assures 
the good reception of this block. The management processor 
PGj is notified of the end of transfer by the change of state 
of the signal borrow issued from the transmission logic 
TFRE2j. The management processor PGj then carries out its 
writing request which becomes effective at the time of 
activation of the writing signal w. This has the effect of 
transferring the contents of the register RDM2j into the 
central memory RAM and to reinitiate for a next transfer the 
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logic TFRR2j. 

This system authorizes a simultaneous transfer of blocks 
in two directions and permits treating more rapidly the 
defects of blocks bi in the cache memory MCj when the latter 
is saturated. It authorizes also the operation of a conveji- 
tional mechanism for anticipation of the reading of blocks. 

In another embodiment shown in FIGURE 5, the connection 
LSj comprises a single bidirectional link provided at each 
extremity with a validation logic LV1 and LV2 constituted by 
a logic gate with two open connector inputs OC1 and 0C2, one 
of the inputs being connected to the series output of the 
memory register RDMj for the gate 0C1 and the processor 
register RDPj for the gate 0C2, the other input being con- 
nected to the output Q of a control flip flop BCl and BC2; 
each of these is connected by its inputs S and R to the 
transfer logic TFR for the flip flop BD1 and TFR 1 for the flip 
flop BD2. 

Readings and writings are carried out in an exclusive 
manner, at the sole initiative of the management processor PGy. 

A reading memory activates the reading signal r which 
causes the setting to 1 of the flip flop BCl by its input S, 
the resetting to zero being controlled, on the input R by the 
transfer logic TFR at the end of the transfer of the block. 

A writing memory releases a mechanism identical to the 
validation logic LV2. 

Other combinations of registers/links are possible, and 
in the case of a bidirectional link, bidirectional shift 
registers may in particular be used, receiving a signal of the 
direction of transfer. This solution leads to the use of 
shift registers more cpmplex in logic, therefor a priori less 
efficient in speed of transfer. 

The speed of transfer coming to be very high, the shift 
registers RDMj and RDPj, their associated control logic TFR and 
TFR 1 , the validation logics LV1 and LV2, are s lected from a 
rapid technology (ECL, ASGA) , and synchronized by a clock of 

23 



a frequency F at least equal to 100 MHz. 

Another solution with multiplexed registers shown in 
FIGURE 21 permits, as will be understood below, considerably 
reducing the quantity of efficient, and therefore costly logic 
necessary. 

The multiprocessor of FIGURE l was provided at the same 
time with a common communications bus for block addresses and 
data transfer series links. FIGURE 6 shows, as a variation, 
a multiprocessor system of the same general principle, but in 
which data and addresses transfer by the series connections, 
in the absence of a common bus. 

This system comprises, besides the memory registers RDM,, 
complementary shift registers RDCj able to store the addresses 
of blocks called for and controlled by a logic of the type 
TFRj. Further, an access management arbitrator ABM is con- 
nected to the central memory RAM and to the complementary 
registers RDCj by their parallel output. Each logic TFRj is 
connected to this arbitrator ABM of conventional structure. 
The management processor PGj of each cache memory MCj is 
connected to one part of the parallel input of the processor 
register RDPj, in order to have access thereto in writing. 

For reading a block b 5 in central memory RAM, the manage- 
ment processor PGj places the address of the block called for 
and the nature of the request (by a prefix bit: l^read, 
0=*write) in the part of the register processor RDP, to which 
it is accessible, which has the effect of initializing the 
transfer of this information. The transfer logic TFRj detects 
the end of the transfer on the complementary register RDC. and 
activates a demand operation toward the arbitrator ABM. This 
is loaded for serializing and treating the read requests of 
the block in the central memory RAM while going to read the 
address of the block requested in the complementary register 
RDCj corresponding to the transfer logic selected by the 
arbitrator ABM, then in going to read the block in the central 
memory RAM which will then be loaded into the memory register 



24 



RDMj and transmitted as before. 

For writing a block in the central memory RAM, the 
management processor PGj chains the transmission of the address 
then of the block to be written through the processor register 
RDPj. The, complementary register RDCj receives at once the 
address and the nature of the request. 

The transfer logic TFRj analyzes this request and vali- 
dates the reception of the block in the memory register RDMj 
from the fact of the nature of the request (write) . The 
transfer logic TFRj is notified of the end of the transfer of 
the block b { and then transmits its request for service to the 
arbitrator ABM. This request is treated , in turn, by the 
arbitrator which activates the writing of the block bi into 
memory. 

Further, the multiprocessor system shown in FIGURE 7 
comprises means for management of the shared data, permitting 
treating, in a static manner, the classical problem of 
maintaining the coherence of the shared data. This system 
comprises the resources of the system of FIGURE 1 {with the 
same reference characters) with the following logic and 
supplementary resources: 

A special parallel communication bus for words BUSD 
connects the processors CPUj and the central memory RAM. A 
partition logic LPj is associated with each processor CPUj. 
Each logic LP, is constituted in a conventional manner by an 
assembly of coupled register-comparators connected in parallel 
on the address bus adr of the processor CPU jf in order to 
provide a division of the memory space of the central memory 
RAM in the zone of non-shared data and shared data, said logic 
LPj delivering to this effect. a signal p (indicating the nature 
of the data, shared or not). A decoding logic DEC is assoc- 
iated with the central memory RAM, itself arranged to be 
controlled by a writing by word or by block by said logic DEC. 

The decoding logic DEC is detailed in FIGURE 8 and com- 
prises a decoder DECL, receiving on its data input the address 
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portion word adrm of the address adr, and connected through 
its validation input to the output of a logic gate ET3, each 
output i of said decoder being connected to a validation 
buffer of the type BFSj. The logic gate ET3 receives on its 
inputs the signal p and the inverted signal r . A decoder DECE 
is connected through its validation input to the output of a 
logic gate ET4, its outputs being connected to an assembly of 
logic gates 0U1, of a number equal to the number of words in 
a block. The logic gate ET4 receives on its inputs the signal 
p and the inverted signal w. The output of the gate ET4 is 
also connected to an assembly of . validation input buffers 
BFE 1# BFEi . • • * The central memory RAM may be controlled in 
writing by word. Each clipped word thus defined has its 
writing control input w t . The output of each logic gate OUli 
is connected to the input wl of each clipped word of the 
central memory RAM. 

FIGURE 8 shows further the detail of the addressing of 
the memory registers RDMj, which comprises in the first place 
a decoder DECEB connected through its data input to the common 
bus BUSA, in order to receive the number i of the processor 
concerned by the request of the central unit UCj. This decoder 
DECEB is connected through its validation input to the output 
of a logic gate ET5 and through its outputs i, 2... i to the 

validation buffers BV1, BVj The logic gate ET5 receives 

on its inputs the signal p and the inverted signal w. In the 
same manner, a decoder DECLB is connected through its data 
input to the field j of the common bus BUSA and through its 

outputs l f 2 j of this decoder DECLB are connected to the 

loading inputs Idl, ldj of the memory shift registers RDMj. 
The logic gate ET6 receives on JLts inputs the signal p and the 
inverted signal r. 

The operation of the system is as follows: at each 
reference memory, the processor CPUj provides an address on 
its address bus adr, and the nature of the request : reading 
r or writing w. It waits for a datum in case of reading and 
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furnishes a datum in case of writing. The address adr 
traverses the partition logic LPj, which indicates, by the 
signal p, if the address adr belongs to a zone, of non-shared 
data (p=0) or of shared data (p=l) . In the first case, the 
request is directed to ,the management processor PG. and is 
treated according to the mode of operation described in 
reference to FIGURE 1. In the second case, the request is 
directly sent through the common bus BUSA. The bus address 
adr permits supplementary address streams permitting iden- 
tification of the word concerned: the address adr is com- 
prised of a block address part adrb . and a word address part 
adrm. Thus, after agreement of the bus arbitrator AB, the 
central memory RAM receives either a block transaction request 
(p»0) and in this case, only the block part adrb of the 
address adr is significant, or a word transaction request 
(p=l) and, in this case, the whole address adr (block adrb and 
word adrm) is significant. 

In case of a reading block, p=0 and r=0, the logic gate 
ET* validates the decoder DECLB which delivers a loading signal 
LDj to the shift register RDMj, permitting loading into the 
latter the block read into the central memory RAM with the 
address adrb by the reading signal r. 

In the case of a writing block, p=0 and w=0, the logic 
gate ET5 validates the decoder DECEB which delivers a valida- 
tion signal to the buffer BVj, permitting the contents of this 
register to be presented to the central memory RAM and to thus 
be written to the address adrb, the output of the logic gate 
ET5 providing the block writing signal. This latter is 
broadcasted on the writing inputs wl, w 4 , . . . to the clipped 
words of the central memory RAM -across the logic gates 0U1 4 . - 
In case of word reading, p=l and r=0, the logic gate ET3 
validates the decoder DFCL which delivers a validation signal 
to the buffer BFSj, permitting the requested word (address 
adrm in the block adrb) of which the reading is assured by the 
signal r, to be directed toward the special communication bus 
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BUSD. This word is recovered directly by the processor CPUj 
on its data input data. 

In case of writting word, p=l and w=0, the logic gate ET4 
validates the decoder DECE which furnishes on its output { a 
signal directed across the Logic gate OUli toward the writing 
input w, of the clipped word of the central memory RAM con- 
cerned. This signal present at the input Wj permits writing 
in this sole clipped word the word furnished by the processor 
CPUj on the data bus BUSD. The contents of this bus are 
presented in parallel on all the clipped words of the central 
memory RAM, due to an activation of the buffers BFEj by the 
signal emitted by the logic gate ET4. 

An essential characteristic of the architecture of the 
invention is to present a minimum load of requests on the 
common bus BUS A. In the architecture shown schematically in 
FIGURE 7 , the common bus BUSA is driven by the block addresses 
and the word addresses. The frequency of the word address 
requests is a function of the rate of shared data and may lead 
to a saturation -of the common bus BUSA. 

FIGURE 9 shows a variation of a solution for reducing 
this load. The system provided comprises, in addition to the 
resources of FIGURE 7, a bus BUSAM for the word addresses, an 
arbitrator AB* for arbitrating the access conflicts to the bus 
BUSAM, an arbitrator ABM for arbitrating the access conflicts 
in the origin of the busses BUSA and BUSAM, and connected to 
a multiplexer MUX which in turn is connected through its 
inputs to the two busses BUSA and BUSAM. 

The operation of this system is as follows: 

If the request concerns non-shared data (p=0) , any lack 
of information generates a memory request of the block type 
which transfers through the common bus BUSA. 

If the request relates to shared data (p=l) , the request 
is directed toward the common bus BUSAM. Thus, the central 
memory RAM may receive simultaneous requests on the two busses 
BUSA and BUSAM, which must then be arbitrated. The arbitrator 

28 . 



ABM allows , in a conventional manner, access to the central 
memory RAM by one of the two requests and reconstructs the 
signal p from the origin of the request (p=0 for BUSA, p=l for 
BUS AM) . The signal p thus controls both the multiplexer MUX 
which allows the signals from the bus concerned to pass 
through the request,, and the decoding logic DEC: one is thus 
situated as in the preceding system. 

It will be noted that the load is shifted from the common 
bus to the central memory RAM, since the rate of request at 
the level of this latter remains the same, and its cycle time 
is of the same order of magnitude or even greater than that of 
the bus cycle. 

This solution is therefore only of "interest if the 
central memory RAM comprises central independent memory banks 
organized according to the description given later with 
reference to FIGURE 20: several transactions may in this 
case, if they affect different memory banks, take place 
simultaneously. 

FIGURE 10 presents a block diagram of an embodiment of * 
the architecture according to the invention, in which the 
problem of shared data is treated in a dynamic fashion. To 
this end, the system conforming to this embodiment comprises 
an snooper processor with a bus PE j# coupled to a management 
processor with a parallel link PGPj. A management processor 
with a series link PGSj is connected to a snooper processor 
PEj by a waiting line FIFOj. A management processor for 
requests from the central unit PGUj is connected, first to the 
treatment processor CPUj, and to the parallel link management 
processors PGPj and the series link manager PGSj. The logic 
- corresponding to the management processor -PGj of each cache 
memory is in this embodiment split into various processors 
presented hereabove. The access to the cache memory MCj and 
to its directory RGj is controlled by a management processor 
with a directory and a cache PGRj. 

Finally, a management proc ssor PGM of the central memory 
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RAM is connected to the bus BUSA and to the central memory 
RAM, and to its shift registers RDM,. 

The functioning of the assembly is as follows: 
Each transaction on the common bus BUSA corresponds to a 
reading or writing request of the block b,. The snooper 
processors of the bus PE, are activated by each reading request 
of the block of data. This operation provided in the same 
cycle through all of the snooper processors is going to permit 
protecting the uniqueness of the value of the shared data. 
The snooper processor PE, arranges an access to the directory 
RGj. The function of the application used for the management 
of the cache memory MCj is in the described embodiment of the 
direct application type. Each element of the directory is a 
descriptor of the block which contains a "tag" field (address 
of the block) , conventional status bits of the block: a 
validation bit v and a modification bit m and two supple- 
mentary bits a for noting that the block is known by the cache 
memory but still in the course of transfer on the series link, 
f for indicating that the block is in the waiting line FIFO 
and thus avoiding its being placed several times. 

The management processor memory PGM arranges a waiting 
line AFIFO of addresses of blocks bi and of addresses of 
processors, accessible in an associative manner, and a. status 
directory of blocks constituted by 2 bits per block ro arid rw 
indicating the possible states of the following block: 
ro = rw * o: block not yet broadcast, 
ro - 1; rw - 0: block already broadcast in reading: 
one or several copies of this block being found in the cache 
memories . 

- The - evolution of block status bits is the following, 
different according to the nature of the request of the treat- 
ment processor CPU,: 

If the processor CPU, makes a request for reading 
non-shared data (program space or explicitly non-shared data) : 
the block is marked already broadcast in reading (ro * i; rw 
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- 0) central memory part at the time of transfer of said block 
from the central memory RAM to the register RDMj and marked 
non-modified (m = 0) , cache memory part in the same cycle of 
operation of the directory RGj of the cache memory (valid 
block) . The snoopers have not reacted to the request on the 
common bus (the request having been made with the indication 
"shared" or "non-shared") • 

If the processor CPU, makes a request for reading 
data (a priori shared) , the common bus BUS A is occupied for 
the time of passage of the information of addresses and of the 
type requested, the time of their treatment by the processor 
PGM and the snoopers of the common bus PEj. In the central 
memory RAM, this block may be: 

1. Not yet broadcast: ro « rw - o. It is then 
transmitted to the central unit UC, and takes the non-modified 
state. 

2. Already broadcast in reading: ro « 1; rw = 0. 
It is then transmitted to the central unit UC,. Its state 
does not change. 

3. Already broadcast in writing: ro = 0; rw - 1. 
The open copy of this block is found in a cache memory MCi. 
The snooper processor associated with this cache memory has 
noted the request of the central unit UC, at the time of 
passage of the address on the common bus and has attempted its 
transfer to the central memory RAM as soon as possible on the 
series link LSj. While waiting its effective transfer, the 
memory management processor PGM places the request while 
waiting in the associative waiting line which comprises a 
number of elements equal to the number of processors. 

At the -time bf the reading request on the - common bus 
BUSA, all the snooper processors PEj have reacted by consult- 
ing the directory RG S associated with their cache memory MCj. 
The common bus BUSA is only freed when all the snooper 
processors PE| have had their access to the management direc- 
tory RGj, which assures the same state of the block in the 
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entir system. The processor which has the current copy in 
its cache memory carries out, as soon as its series link is 
free, the transfer of this block into the register RDM, and 
makes a writing request for the block on the common bus which 
will have the effect to free the request while waiting in the 
associative file AFIFO and. carrying out the operation of the 
status bits of the block. 

. The operation of the block only needs a writing to the 
central memory RAM without activation of the snooper. 

If the processor CPUj requests the writing of a datum in 
a block present in its cache memory MCj with a non-modified 
state, an informative writing request must be emitted on the 
common bus BUSA as it is possible that other cache memories 
MC t have this block with the same status . These other memories 
must be informed of the change of state. To this end, all the 
snooper processors PE, (activated by the informative writing 
broadcast on the common bus BUSA) consult their management 
directory and invalidate this block, while the central memory 
notes in the Scuae time the change of state of this block as 
well as the parallel management processor PGPj, in the manage- 
ment directory RGj. The liberation of the common bus BUSA by 
all the snooper processors and the central memory RAM permits 
the processor CPUj to provide the writing in its cache memory 
MCj, the broadcasting of the status bit of the management 
directory RGj having been carried out. 

If one central unit is awaiting access to the bus BUSA 
for the same request on the same block, its request is 
transformed in simple writing and then follows the protocol of 
the request of the block in writing. 

If the processor CPUj requests writing ..of a datum in a 
block absent from the cache memory MCj, this block is read 
into the central memory RAM and brought into the cache memory 
MCj in order that the writing will be carried out effectively. 
In the central memory' RAM, this block may be: 

1. Not yet broadcast: ro = rw = 0. The block is 
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then emitted on the series connection LSj to the cache memory 
MCj. It takes the states ro = 0; rw = 1 in the central memory 
and the modified state (m=l) in the cache memory, 

2. Already broadcast in writing: ro = l; rw = 0. 
The block, is sent on the series connection LS. to the cache 
memory MCj. It takes the states ro * 0, rw - 1 in the central 
memory, and the modified state (m m l) in the cache memory. 
At the time of a request on the common bus BUSA, the (espion) 
processors PEj have noted the request and invalidated this 
number in the block in their cache memory MCj. 

3. Already broadcast in writing: ro « Q; rw = 1. 
The request is placed in the associative waiting line AFIFO 
and the common bus BUSA is liberated. 

The snooper processor PEj of the cache memory Mq, winner 
of the current copy/ activates as soon as possible the 
transfer of the block requested from its cache memory Mq to 
the central memory RAM. This block is then invalidated in the 
cache memory Mq. 

The central unit UCj makes a writing request to publish 
the block in the two following cases: 

a) the cache memory is saturated and the purging of a 
block requires sending this block into the central memory, 

b) a central unit uq is awaiting a block the only 
current copy of which is found in the cache memory MC r The 
snooper processor notes the request and provides as soon as 
possible the purging of this block. 

The central memory side RAM, each request for a writing 
operation entails a consultation with the associative waiting 
line AFIFO, and in the case of discovery of a central unit uq 
awaiting this block, the- loading of this block into the shift 
register RDM 4 and the publishing of the status bits cor- 
responding to this block. This type of writing request does 
not drive the {espion} processors. 

. The dir ctory management processor PGR, which is added in 
this embodiment, permits the execution of the algorithm men- 
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tioned above, while coordinating the accesses to the directory 
of the cache memory MCj which receives the requests of three 
asynchronous functional units: 

1. The treatment processor CPUj, in order to read 
the instructions of the program during execution and to read 
or write the data manipulated by this program, 

2. The common bus BUSA, in order to maintain the 
coherence of the data in the cache memory MCj, 

3. The series link LSj, in order to load/un-load a 
block of information from/to the central memory RAM. 

Each of these requests arrives at the management direc- 
tory RGj of the cache memory. The serialization of these 
accesses on said management directory permits assuring the 
good operation of the algorithm mentioned above with coherence 
of information in the cache memories. Thus there is . obtained 
a strong coupling of the requests at the level of the manage- 
ment directory RGj but the synchronization which must exist at 
the level of the treatment of these requests is sufficiently 
low to consider an asynchronous functioning of the treatment 
logic for these requests, which leads to the following 
functional segmentation: 

The interface of each processor CPUj and of its auxil- 
iaries (PGSj, PGUj) with the common bus BUSA is composed of two 
parts having a mutually exclusive functioning: the management 
processor for the parallel link PGPj is loaded to request of 
the common bus BUSA to the. request of the request management 
processor PGUj or of the management series link PGSj and to 
control the common bus BUSA in writing. The snooper pro- 
cessor . of the bus PEj assures the snooping function, which 
returns to control the common bus BUSA in reading . - It 
frequently reaches the directory RGj of the cache memory MCj. 

The management processor of the series link PGSj manages 
the interface with the series link LSj. It assures the loading 
and unloading of blocks of information bi upon request of the 
request management processor PGUj and of the snooper processor 
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of bus PEj. It reaches less frequently the cache memory MCj 
and the corresponding management directory RGj. 

The request management processor PGUj assures the follow- 
ing of the requests issued from the processor CPUj. It reaches 
yery frequently - into the cache memory MC, and the management 
directory RGj. This, interface includes possible "MMU" logic 
(memory management unit) usually associated with the treatment 
processor CPUj. 

The management processor PGRj of the management directory 
RGj is the arbitrator charged with granting access to the cache 
memory MCj. 

FIGURES 11, 12, 13, 14, 15, 16 and 17 show by way of 
examples the embodiments of the various functional units of 
the system of FIGURE 10. The designations of signals or 
inputs and outputs of these units are chosen in a conventional 
manner. The signals of the same functionality which are 
generated in each functional unit from a base signal will be 
designated by the same reference, for example: dnp - non- 
shared data, dl = reading request, maj - updating, de - demand 
writing, ej = informative writing. The system has several 
processors and the indicia , j l used heretofore refers to a 
current processor and its auxiliaries; for alleviating the 
description, this indicia has been omitted in these figures 
and it will be understood that the description which follows 
is directed at each of the functional units which are con- 
nected with each treatment processor. Further, the signals 
denoted x_YZ define the name and the origin of the signal in 
the case where YZ - RG, MC, UC, and the source and the 
destination of the signal in the other cases, with Y and Z 
representing: U * PGU, R = PGR, P = PGP or PE, S = PGS. 

The cache memory MC shown in FIGURE 11 has, for example, 
a capacity of 16 K . It is organized in 16 modules of high 
speed read write memory of 1 k MCO, . . .MC^, each accessible 
on a front of 4 octets: the address bus of the cache memory 
MC "(noted adr_MC) comprises a partial block address adr_block 
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and a partial word address in the block adr_mot. The address 
bus adr_MC comprises 14 lines, permitting addressing the 161c 
of the cache memory MC. The partial adr_bloc comprises 8 
lines permitting addressing the 256 block sites in the cache 
memory, and the adrjnot part 6 lines permitting addressing a 
word in the block the size of which is, in the example, 64 
octets. 

The address part adrjbloc is connected to the adress 
input of each of the memory modules MC Q ...MC 15 . The word 
address part adr_mot is connected to the input of two decod- 
ers DECO and DECl (only the 4 significant bits of the address 
bus adr_mot are used : the address is an octet address and 
the cache has an access unit which is a word of 4 octets) . 
The read signal r_MC is delivered to each of the read inputs 
of the memory modules MC Q ...MC 15 and to one of the inputs 
of a logic gate 0U1. The other input of this logic gate 0U1 
receives the inverted signal bloc. The writing signal w_MC 
is delivered to one of the two inputs of the logic gates 0U2 
and 003. The logic gate 0U2 receives on its other input the 
inverted signal bloc. The logic, gate 00 3 receives on its 
other input the signal bloc. The output of the logic gate 
001 is connected to the validation input enl of the decoder 
DECl, and the output of digit position i of this decoder DECl 
activates a validation buffer BVL of digit position i. The 
output of the logic gate 002 is connected to the validation 
input enO of the decoder DECO and to the validation buffers 
BVE. The output of the logic gate 0U3 is connected to the 
logic gates ET1 0 . . .ETl 15 , which receive on their other 
input the output of the corresponding logic . position of the 
decoder DECO. The output i of each logic gate ETl Q ... 
ET1 15 is connected to the writing inpout w 0 ...w 15 of 
each memory module MC Q ...MC 15 . A data bus connects each 
memory module MC Q ...MC 15 to one of the validation buff ers 
BVL and to one of the validation buffers BVE. The output of 
the buffers RVL and the input of the buffers RVE receive in 
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parallel a data bus datamot_MC (connected to the request 
management processor PGU) • 

The operation of the example of the cache memory 
described above is as follows: 
Case 1 

The request comes from the management processor of the 
series link PGS. This case- is signalled by the presence of a 
lo<gic state zero on the signal bloc. 

In reading the cache memory, the management processor 
of the series link PGS presents on the address bus adr_MC the 
address of placement of the block to be read (in this case, 
only the part adr_bloc of the bus adr_MC is used) and activ- 
ates the reading signal* f_MC. At the end of the access time, 
the block is available on the bus databloc_MC. 

In writing the cache memory, the management processor 
of the series link presents on the address bus adrJMC the 
address of placement of the block to write, on the data bus 
databloc_MC the data to be written there, and activates the 
line w_MC. The zero state of the signal bloc directs the 
signal w_MC toward the writing control inputs of the cache 
memory modules MCq...MC2 5 , via the logic gates CO 3 and 
ET1. The information presented on the data bus datablocjMC 
is written in the cache memory at the end of the writing. 
Case 2 

The request come from the request management processor 
PGU of the treatment processor CPU . This case is signaled by 
the presence of a logic state one on the signal bloc 

In the reading cache memory, the management - presents 
on the bus adrJMC the address of the word requested, and acti- 
vates the reading signal ?J4C. The block corresponding to 
the part adr bloc is read in cache memory, and the word 
requested is directed, via one of the validation buffers BVL, 
toward the data bus datamotJMC. The validation buffer BVL 
concerned is activated by the output of the decoder DECl 
corresponding to the word address adr_mot requested. 
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In writing the cache memory, the management processor 
PGU presents on the bus adr_MC the address of the word to be 
written, on the data bus datamotJIC the data to be written, 
and activates the writing signal w_MC. The data presented on 
the bus datamot_MC is distributed on each cache memory mod- 
ule, via the buffers BVE validated by the writing signal. 
The writing signal w^MC is then presented to only the memory 
module concerned. It is delivered to the output of the decod- 
er DECO corresponding to the address adr_mot concerned. 

In the embodiment described above, the problems of 
access in an octet and double octet, and of access in double 
octet and a word on both sides of two memory modules are 
resolved in the same manner as in conventional information 
systems and are not described here. 

FIGURES 12a, 12b, 12c and 12d show, by way of example, 
the dharacteristics of a management directory of the cache RD 
and of an associated management processor PGR. FIGURE 12a 
illustrates the logic structure of the address adr_RG in the 
hypothesis of a space addressing on 32 bits and with the char- 
acteristics of the cache memory described previously. The 
field -tag-, composing the address block, is coded on 18 
bits. The field -cadre- is coded on 8 bits and permits 
addressing the 256 positions of the block of the cache memory 
MC. The last 6 bits define the address word in the block, in 

an octet unit. 

FIGURE 12b shows the structure of the management 
directory of the cache RG, which is a simple fast read/write 
memory of 256 words' of 22 bits. Each word of the address i 
contains the descriptor of the block written. in the position 
i of the cache memory . 

FIGURE 12c shows schematically the structure of the 

descriptor which comprises: 

a field tag of 18 bits, defining the address of the 
block in the position or frame of the current block, 

the validation bit v, 
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the modification bit m, 

the wait-for-end-of -transfer bit a, 

the* wait-for-purge bit f. 

FIGURE 12d provides the structure of the processor 
PGR f which is nothing other than a conventional arbitrator 
with fixed priority. 

This arbitrator "comprises a register LATCH , of which 
three inputs .receive respectively the signals rqst_UR, 
rqst_PR, rqst_SR , also delivered respectively to the logic 
gates ET2 , ET3 r ET4 . The corresponding outputs of the regis- 
ter LATCH are connected to the inputs of a priority encoder 
PRI, the outputs of which are connected to the inputs of a 
decoder DECPRI. The outputs of the digit position correspond- 
ing to those of the register LATCH are connected to the sig- 
nals grnt_OR, grnt_PR, grnt_SR as well as , in an inverted 
manner, respectively to the inputs of the logic gates ET2, 
ET3, ET4. The outputs of the logic gates ET2, ET3, ET4 are 
connected to the inputs of the logic gate N001. The output 
of the logic gate N0U1 is connected to a flip flop Bl, which 
receives on its input D the output eO of the priority encoder 
PRI. The assembly of the apparatus is synchronized by a gen- 
eral clock which delivers a signal h to one of the inputs elk 
of a logic gate ET5, and in a reverse manner , to the clock 
input of the flip flop Bl. . The output Q of the flip flop Bl 
is connected to the other input of the logic gate ET5. The 
output of the logic gate ET5 is connected to the input load 
of the register LATCH • 

The operation of this "arbitrator is as follows: in 
the absence of any request on the lines rqst , the flip flop 
Bl stores permanently the state of the line eO, inactivates, 
and thus validates through the logic gate ET5 the loading of 
the register LATCH. 

The arrival of a signal rqst provokes the latching of 
the clock and the activation of the signal grnt associated 
with the signal rqst, until deactivation of- the latter: the 



39 



arbitrator is fixed in its state during the entire transac- 
tion in progress. 

The request management processor PGU, represented in 
FIGURE 13 , constitutes an interface between the treatment 
processor CPU and: 

the various processors with which it must exchange 
information: paralllel "management processor PGP, series 
management processor PGS f directory management processor PGR, 
and 

the management directory of the cache memory RG and 
the cache memory MC. 

The treatment processor CPU unlatches the activity of 
the request management processor PGU while activating the sig- 
nal as ("address strobe 11 ). This signal validates the address 
bus adr_CPU, the reading signals r_CPU and the writing sig- 
nals w_CPU as well as the function lines fc_CPU of the treat- 
ment processor CPU. The treatment processor CPU is then 
placed in a waiting state until discharge of the request by 
the signal dtack_CPU . 

The signal as is connected to the input of a different- 
iator circuit DIO. The output of this circuit is connected 
to one of the three inputs of a logic gate ET12, the two 
other inputs receive respectively the signals ack_US and 
ack_UP . This latter signal is also delivered to the input R 
of a flip flop B13. The input S* of the flip flop B13 
receives the output of a logic gate NET10 . The output of the 
logic gate ET12 is connected to the input S of a flip flop 
Bll, at the input S of the flip flop BIO and at the input 
clearlO of a shift register SR10 . The output Q of the flip 
flop Bll furnishes the signal rqst_UR. The flip flop Bll 
receives on its input R the inverted phase 613 and the flip 
flop BIO the inverted phase 811. The output Q of the flip 
flop BIO is connected to the series input serial_inlO of the 
register SR10. The shift register SR10 receives on its input 
clklO the clock signal -h- and on its validation input enlO 
the signal grnt_UR. 
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The activation of the signal as releases the operation 
of the differentiation circuit DlO. The pulse produced by 
this circuit across the logic gate ET12 , sets to the logic 
state one of the flip flops BIO and Bll by their input S, and 
also carries out the resetting to zero of the shift register 
SR10 by its input clear 10 . 

The flip flop BIO antf the shift register SR10 consti- 
tute the logical sub-assembly "phase distributor" DP_U. The 
activation of this phase distributor is released by the sett- 
ing to one of the flip flop BIO and the resetting to zero of 
the shift register SR10 . If the shift register is .validated 
by the presence of a level zero on its input enlO , then the 
next pulse of the clock h on the input elk of the shift regis- 
ter produces the shifting of one step of said register. 

The logic state one of the flip flop BIO is propagated 
on the output 911 of the shift register SR10 by its series 
input serialinlO . The output Qll, called phase 011, invert- 
ed, resets the flip flop BIO to zero through its input R. 
Thus, one single bit is introduced to the shift register SR10 
on each activation of the phase distributor DPJD. At each 
clock pulse h, this bit comes to be shifted in the shift 
register SR10 and produce the consecutive disjointed phases 
911, 912, 913. 

The setting of the flip flop Bll to the logic state of 
one causes activation of the signal rqst_OR. This signal is 
emitted to the destination of the directory management proces- 
sor PGR. This latter, as soon as possible, comes to accord 
access to the management directory RG and to the cache memory 
MC while activating the signal grnt_UR, which comes to valid- 
ate the assembly of passage buffers BV10 , BVll and BV12 situa- 
ted respectively on the busses of the management directory 
and on the busses of the cache memory. This signal grntJUR 
also validates the phase distributor which then produces 
sequentially the phases 811, 912, 913. 
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The phase 811 corresponds to a time-out permitting 
reading of the descriptor of the block requested by the treat- 
ment processor CPU in the management directory RG, addressed 
by the field frame with the address adr_CPU and connects to 
the bus adr_RG through the passage buffers BV10 . The signal 
r_RG is always active on the input of a buffer BVlO , the sig- 
nal wJRG always inactive on the input of a buffer BVlO . At 
the ' start of the time out, the descriptor is returned to the 
processor PGU via the bus date_RG. The part tag of this 
descriptor and the validation bit v are delivered to one of 
the comparison inputs of a comparator COMP10, the other input 
being connected to the part tag of the address adr^CPO . The 
bit opposite the validation bit v is always at one. The com- 
parator COMP10 is validated permanently by the presence of a 
level of one on its input enll. 

The time of access of the management directory RG and 
the frequency of the clock h are in a ratio such that at the 
end of the phase 911, the output eg 10 of the comparator 
COMP10 is positibnned and furnished the information "the 
block requested is present in the cache memory or absent from 
the cache memory 

If the block requested is present in the cache memory 
MC (eglO » 1) , then the signal eglO, delivered to one of the 
two inputs of the logic gate ET10 , furnishes a signal calib- 
rated by the phase 812 , present on the outher input of the 
logic gate ET10 . 

This calibrated signal present on the output of the 
logic gate ET10 is connected to the inputs of the logic gates 
NET10 , NET 11 , NET12. 

The logic gate NET10 receives on its inputs, in 
addition to the output of- the logic gate ET1Q, the inverted 
status bit m coming from the descriptor, arid the inverted 
writing request signal w_CPU. 

The activation of the logic gate NET10 corresponds to 
the state "request writing of a word in a block present in 
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the cache and which has not yet been modified (m = 0)". The 
output of the logic gate NET10 is connected to the input S of 
a flip flop B13. The activation of the logic gate NET10 sets 
the flip flop B13 in the logic state of one, which releases 
an informative writing request by the line rqst_UP to the 
management processor of the parallel link PGP. The address 
of the block concerned is furnished by the lines adr_blocJJP, 
derived from the lines adr_CPU. 

The request management processor PGU has terminated 
the first part of its task: the management directory RG and 
the cache memory are freed by deactivation of the signal 
rqst_UR r as a consequence of the arrival of the inverted 
phase 813 on the input R of the flip flop Bll. 

The informative writing mechanism is described in the 
paragraph "management processor of the parallel link PGP", 
and has for effect to place the requested block in the modi- 
fied state (m = 1) or the invalid state (v 0). One will 
note that the liberation of the management directory RG and 
the cache memory MC by .the request management processor PGU 
is necessary in order that the management processor of the 
parallel link PGP may have access thereto. The end o£ the 
"informative writing" operation is signalled to the request 
management processor PGU by the activation of the signal 
ackJJP, which has the effect of resetting the flip flop B13 
to zero and activating, through the logic gate ET12, the flip 
flop Bl2 r the flip flop Bll and the phase distributor: the 
cycle initiall activated by the signal as is reproduced, but 
the consequential sequence of the activation of the gate 
NET 10 is not reproduced a second time in this request cycle. 

The logic gate NETll receives on its inputs, other 
than the output of the logic gate ET10 , the status bit m from 
- the descriptor, and the inverted signal of the . writing 
request w_CPU. The activation of the gate NETll corresponds 
to the state "writing request in a block present ia the cache 
and which has already been modified. The output of the gate 
NETll is connected, through one of- the buf f ers * BVll , . to the 
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writing signal wJMC of the cache memory MC. This signal 
permits writing in the cache memory , at the address present 
on the bus adr_MC, connected to the bus adr_CPU, via the 
buffers BVll, the data present on ths bus data_MC, connected 
to the bus data_CPU via the bidirectional buffers BV12 . The 
direction of activation of these buffers is furnished by the 
signal w_MC. ■ 

The output of the gate NET11 is also connected to one 
of the inputs of the logic gate ET11, which thus re-^sends the 
signal dtack_CPU to the treatment processor CPU. The writing 
operation in the cache is done in parallel with the activa- 
tion of the signal dtack_CPU , which is conformed to the usual 
specifications of treatment processors. 

The operation is terminated by freeing of the manage- 
ment directory RG and of the cache memory MC by deactivation 
of the signal rqst_UR, as a consequence of the arrival of the 
inverted phase &13 on the flip flop Bll. 

The logic gate NET12 receives on its inputs, in addi- 
tion to the output of the . logic gate ET10, the inverted read- 
ing request signal r_CPU. The activation of the logit gate 
NET 12 corresponds to the state "request reading of a word in 
a block present in the cache." 

The continuation of the operations is identical to the 
preceding operation, with the sole difference of the activat- 
ed signal <r_MC rather than w_MC) associated with the direc- 
tion of transit of the data on the busses data_CPU and 
data_MC . 

If the block requested is absent from the cache ..memore 
(eglO « 0) , then the signal eglO, inverted, connected to one 
of the two inputs of the logic gate NET13, furnishes a signal 
calibrated by the phase 912, present on the other , input of 
the logic gate NET13. The output of the logic gate NET13 is 
connected to the input S of the flip flop B12. This calibrat- 
ed signal forces the flip flop to one, which has the effect 
of sending a service request rqst_US to the management proces- 
sor of the series link PGS. This processor also" receives the 
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address of the block to request on the lines adr_bloc_US and 
the nature of the request on the lines w_US, r_US and fc_US. 

The request management processor PGU has terminated 
the first part of its task: the management directory RG and 
the cache memory MC are freed by deactivation of the line 
rqst_UR, as a consequence of the arrival of the inverted 
phase 613 on the flip flop Bll. 

The mechanism" for updating the cache is described in 
the paragraph "management processor for the series link PGS". 

One will note that the liberation of the management 
directory RG and the cache memory MC is necessary in order 
that the management processor for the series link PGS may 
have access thereto. 

The updating of the cache is signalled to the request 
processor PGU by the activation fcf the signal aclcJJS. This 
signal is delivered to the input R of the flip flop B12 and 
to the input of the gate ET12 . It thus has the effect of re- 
setting to zero the flip flop Bl2 and activating through the 
logic gate ET12, the flip flop Bll and the phase distributor: 
the cycle initially started by the signal as is reproduced, 
but this time with success due to the presence of the block 
in the cache . memory • 

The series management, processor PGS shown by way of 
example in FIGURE 14 is charged with managing the series link 
LS and in this way to provide the block transfer requests 
between the central memory RAM and the cache memory MC and to 
produce the corresponding operations in the management 
directory RG. It treats in priority the requests from the 
snooper processor PE and waits in a waiting line FIFO, It 
also treats the requests from the request processor PGU. 

This management processor for the series link PGS 
comprises a flip flop B20 which receives on its data inputs D 
the signal empty coming from the waiting file FIFO and on its 
clock inputs the output Q of a flip flop B22. A. flip flop 
B21 receives on its input the output data of a logic gate 
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OU20. This logic gate OU20 validates the signal rqst_us, 
received on one of its two inputs, the other input being 
connected to the signal empty : The clock input of the flip 
flop B21 coming from the output Q of the flip flop B22 . The 
output Q of the flip flop B22 is reconnected on its data 
input D, which conditions it while dividing by two. The 
clock input of the flip flop B22 is connected to the output 
Of a logic gate ET20, which receives on one of its inputs the 
general operating clock h and on the other input a validation 
signal. This validation signal comes from the output of a 
logic gate ET24, which receives respectively on- its two 
inputs the outputs <5 and Q of the flip flops B20 and B21. 

The flip flop B20 receives on its input R the phase 
825, inverted, issuing from a phase distributor DP_S. the 
flip flop B21 receives on its input S the output of a logic 
gate NOU20. This logic gate NOU20 receives on its two inputs 
the respective outputs of the logic gates ET22 and ET23. The 
logic gate ET22 receives on its inputs the phase 025 from the 
phase distributor and the signal maj from a logic gate ET29. 
The logic gate ET23 receives oh its inputs the phase 627 from 
the phase distributor and the inverted signal maj. 

The assembly of the circuits B20, B21, B22 , ET20 , 
ET22, ET23, OU20 , ET24, NOU20 ' constitute a fixed priority 
arbitrator ARB_S. Its operation is as follows: the flip 
flop B22 furnishes on its outputs Q and Q the alternate freq- 
uency signals half of that of the general clock. These sig- 
nals validate alternatively the flip flops B20 and B21. If a 
request for service is present on one of the. inputs of the 
flip flops B20 or B21, then these alternated signals store 
the request in the corresponding flip flop (B20 for the snoop- 
er processor PE, B21 for the request management processor 
PGU) which, in return, lock out the alternated function. One 
will note that the signal rqst_US originating from the 
request management processor PGU is conditionned by the sig- 
nal empty through the gate 0020: the signal is only 
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considered if the waiting line FIFO is empty. The flip flop 
B20 (respectively B21) is only reset to zero when the transac- 
tion to be carried out is terminated. 

The sampling of a request on one or the other of the 
flip flops B20 and B21 translates by a change of state of the 
output of the logic gate ET24. The output of the logic gate 
ET24 is also connected to a differentiator circuit D20 which 
delivers a pulse upon changing of the state of the output of 
the logic gate ET24. The output of the differentiator cir- 
cuit is connected , on one part* to the phase distributor DP_S 
(input S of flip flop B36 and clr20 of a shift register SR20) 
of the management processor for the series link, and for the 
other part to one of the two inputs of the logic gate 0022. 
The output of the logic gate is connected to the inputs S of 
the two flip flops B24 and B25 . The flip flop B24 receives 
on its input R the output of a logic gate N0U21 and the flip 
flop B25 the signal grnt_SR. The logic gate N0021 receives 
on its two inputs the outputs of the logic gates ET36 and 
ET37. The logic gate E36 receives on its inputs the phase 
02 3 from the phase distributor and the signal maj, and the 
logic gate E37 the phase &27 from the phase distributor and 
the signal maj inverted. 

The pulse from the differentiator circuit D20 thus 
initializes the phase distributor and sets the logic state of 
the flip flops B24 and B25 to one across the logic gate OU22. 

The phase distributor DP_S is composed of shift regist- 
er SR20 and the flip. flop B36. Its operation is identical to 
that described in the paragraph concerning the request manage- 
ment processor PGU. 

The output Q of the flip flop B24 is connected to the 
signal rqstjSR, to the destination of the directory manage- 
ment processor PGR. Its activation releases a request for 
service to this processor , which responds by the line 
grnt_SR, connected to the input ll of the flip flop' B25. The 
output Q of the .flip flop B25 is connected to one of the 
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inputs of a logic gate OU23. the output of thje logic gate 
OU2 3 is connected to the input en20 of the shift register 
SR20. 

The logic assembly B24 and B25 constitute a resynchron- 
ization logic RESYNC_S between asynchronous units. Its opera- 
tion is as follows: 

A service request rqst_SR to the directory management 
processor PGR is made by activation of the flip flops B24 and 
B25 through the logic gate OU22, which authorizes two origins 
of activation. The logic proper to the directory management 
processor PGR assures a response grnt_SR in an indeterminate 
time, which resets the flip flop B25 to zero. The flip flop 
B24 maintains its request until it is reset to zero by activa- 
tion of its input ll. In return, the directory management 
processor PGR deactivates its line grnt_SR: the resynchroni- 
zation logic is ready to operate for a next request for serv- 
ice. The output Q of the flip flop B25 serves to block the 
phase distributor by action on its input en20 via the logic 
gate OU23: the resetting to zero of the flip flop B25 frees 
the phase distributor which furnishes the first phase 921 
upon the next active transition of the general clock h, 
connected to the input clk20 of the phase distributor. 

The line grnt_SR is also connected to" the validation 
buffers BV20, BV25 and BV21 r BV22 which open access respect- 
ively to the management directory RG and to the cache memory 
MC. 

If the flip flop B20 is active, then the transaction 
in progress- is a purge of the block requested .by the snooper 
processor PE via the waiting line FIFO. The .output Q of th 
flip flop B20 is connected to the validation buffers BV24. 
These buffers receive on their input the output of a register 
REG20. The output Q of the flip flop B20 is connected to one 
of the two inputs of a logic gate 0U21, which receives on its 
other input the output of the differentiator circuit D20. 
The output of the logic gate 0021 is connected to the input 
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Ioad20 of the register REG20 and to the input read20 of the 
waiting line FIFO. 

Thus, the activation of the flip* flop B20 provokes: 

1. Initialization of the phase distributor, 

2. A request for access to the management 
directory RG and to the cache memory MC, 

3. The loading' of the element at the head of the 
waiting line FIFO in the register REG 20 and advancing of the 
waiting ling, 

4. The validation of the buffer BV2 4: the bus 
adr_x contains the address of the block to be purged. The 
nature of the operation (carried out) will be relocated from 
the combination of the bits v and m (signal maj). 

If the flip flop B21 is active, then the transaction 
in progress comes from an information defectin the cache 
memory MC. The output Q of the flip flop B21 is connected to 
the validation buvvers BV23. These buffers receive on their 
input the information originating from the request management 
processor PGU. 

Thus, the activation of the flip flop B21 provokes: 

1. The initialization of the phase distributor, 

2. A request for access to the management 
directory RG and to the cache memory MC, 

3. The validation of the buffers BV23: the bus 
adr_x contains the address of. the block which has provoked 
the information defectin the cache memory MC, and the nature 
of the request: ' -reading or writing, shared data of 
non-shared data (lines fc_U8) ♦ 

The field "frame" 6f the bus adr_x is .connected to the 
address lines adr_RG of the management directory through the 
validation buffers BV20. the outputs Q of the flip flop B26 
and B27 are respectively connected to the lines 7_RG and w_RG 
of the management directory through the validation . buffers 
BV20. The flip flop B26 receives on its input S the output 
of the logic gate 0022 and on its input R the phase 022 
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inverted, coming from the phase distributor. The flip flop 
B27 receives on its input S the output of a logic gate NOU22 
and on input R the output of a loghic gate NOU23. The logic 
gate NOU22 receives on its two inputs the respective outputs 
of the logic gates ET25 and ET26, themselves receiving on 
their inputs, for the gate ET25 the phase 822 and the signal 
maj, and for the gate ET26 the phase 926 and the signal ma j . 
The logic gate NOU23 receives on its two inputs the respec- 
tive outputs of the logic gates ET27 and ET28 , themeselves 
receiving on their inputs, for the gate ET27 the phase 823 
and the signalmaj inverted, and for the gate ET28 the phase 
827 and the signal maj inverted. 

The field tag of the bus adr^x is connected to the 
field tag of the lines data_RG through the validation buffers 
BV25. These latter receive on their validation input the 
output of a logic gate OU24 which receives on its inputs the 
signal grnt_SR and the output Q from the flip flop B27 . 

The inputs D of the flip flops B28 and B29 are respect- 
ively connected to the bit validation lines v and bit modifi- 
cation lines m of the bus data_RG. The clock inputs of these 
flip flops B28 and B29 are connected to the phase 022 of the 
phase distributor. The output Q of the flip flop B28 and the 
output Q of the flip flop B29 are connected to two inputs of 
a logic gate ET29 , which furnishes on its output the signal 
maj. A logic gate OU25 receives on its inputs the signal maj 
and the output Q of a flip flop B30. The output of the logic 
gate OU25 is connected to the selection input sel20 of a 
multiplexer MUX20,- which receives on its two data inputs the 
output Q of the flip flop B28 (bit v) and the. constant 0, the 
constant zero being selected when the selection input sel20 
is at the logic state of one. The output of the multiplexer 
MUX20 is connected to the bit validation line v of the bus 
adr_x. The bit waiting line -a- of the bus adr_x is forced 
to the logic state zero. The modification bit -m- is connect- 
ed to the reading line r_adr_x of the bus adr_x. 
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The logic assembly described above constitutes the 
access logic and operation logic of the management directory 
RG - operation is as follows: the activation of the 

signal grnt_SR, authorizing access to the management director 
and to the cache memory, validates the validation buffers 
BV20. The reading of the descriptor concerned is controlled 
from the beginning of the access authorization until the 
arrival of the phase 922 , the storage instant of the bits -v- 
and -m- in the flip flops B28 and B29 k The combined state Of 
these two bits produces, across the logic gate ET29 , the 
signal maj which conditions the succeeding operations. 

First Case: • maj = 1. This case is produced when 

the validation bit is equal to 1 and the modification bit is 
equal to 1. This case corresponds either to a request to 
purge the block from the snooper processor PE, or to an 
information defect established by the request management 
processor PGO upon placement of the block occupied and 
modified: in the two cases, the block concerned must be 
written into the central memory RAM. 

To this effect, the field -cadre- of the bus adr_x is 
connected to the lines adrJ4C via the validation buffers 
BV21. The outputs Q of the flip flops B30 and B31 are respec- 
tively connected to the lines r_MC and w_MC of the cache mem- 
ory MC through the validation buffers BV21. The line bloc is 
forced to zero via one of the. validation buffers BV21. The 
data lines data_MC of the cache memory are connected, via the 
bidirectional validation buffers BV22 to the inputs of the 
shift register RDP and to the outputs of the validation buf- 
fers BV2 6, which receive on their inputs the. output lines of 
the shif t reg ister RDP. The buffers BV22 are validated by 
the line grnt_SR, and their direction of validation controll- 
ed by the output Q of the flip flop B31. The flip flop B30 
receives on its inputs S and R respectively the phases Q21 
and Q23, inverted, coming from the phase distributor. the 
flip flop B31 receives on its inputs S and- R respectively the 
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outputs of the logic gates ET32 and ET33. The logic gate 
ET32 receives on its inputs the phase 926 and the signal maj 
inverted f the logic gate ET33 the phase 827 and the signal 
maj inverted. A logic gate NET 20 receives on its inputs the 
phase 82 3 and the signal maj. The output of . a logic gate 
ET3S controls the validation buffers BV26, and receives on 
its two inputs respectively the signal maj inverted and the 
output Q of the flip flop Q31. The output of the logic gate 
NET 20 is connected to the signal load21 of the shift register 
RDP and to the input S of a flip flop B32. The input R of 
this flip flop B32 receives the signal f in_transf ertjaaj com- 
ing from the logic TPR associated with the shift register 
RDP • The output Q of the flip flop B32 is connected to one 
of the inputs of the logic gate QU23. 

The logic described above permits purging a block from 
the cache memory. Its operation is as follows: 

In parallel with the access to the management direc- 
tory RG , a reading of the cache memory MC is activated by the 
line r_MC, coming from the flip flop B30, during the phases 
Q21 and 622. At the output of this reading , the data read, 
representing the block to be downloaded , are presented at' the 
input of the shift register RDM. The activation of the sig- 
nal maj, the state of which is known from the beginning of 
the phase 822, provokes: 

1. The invalidation of the block in the manage- 
ment directory: the input sel20 of the multiplexer MOX20 
being at the logic .state one, the value zero is forced on the 
validation bit, the descriptor being written in the manage- 
ment directory RG with the activation of the. signal w_RG, 
controlled by the flip flop B27 during the cycle Q22 , 

2.. The loading of the shift register RDH and the 
activation of the transfer, upon passage of the phase 922 to 
the phase 623, 

3 . The placement at the logic state one of the 
flip flop B32, which comes to block the phase distributor on 
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the Pha se 823 until the end of the transfer, signalled by the 
signal f in_transf ert_maj coming from the logic TFR, 

4. The liberation of the management directory RG 
and of the cache memory MC by resetting to zero the flip flop 
B24 on the phase 823. 

Thus, the access to the directory if freed (therefor 
accessible for the snooper processor PE) , the transfer of the 
operation is in progress, and the phase distributor blocked 
on Q23. 

When the transfer is terminated, the block is in 
waiting state in the shift register RDM and it is necessary 
to activate the management processor of the parallel link in 
order that the writing into central memory RAM will be 
effective . 

To this effect, the flip flop B33 is connected by its 
output Q to the service request line rqsl_SP at the destina- 
tion of the parallel link management processor PGP. The out- 
put Q is connected to one of the inputs of the logic gate 
OU23, the input S to the phase 624 inverted from the phase 
distributor and the input IT to the signal ack_SP. The bus 
adr_x is connected to the bus adr_bloc_SP of the parallel 
link management processor PGP. One of the lines of the bus 
adrJ>lbc_SP receives the signal maj in order to indicate the 
nature of the requests position open. 

After the liberat ing of. the phase distributor by the 
signal f m_transf ertmaj , the next active transition of the 
general clock h causes the passage of the phase 823 to the 
phase 824. The phase 824 provokes a service request from the 
parallel link management processor PGP (activation of the 
line rqst__SP) and the blocking of the phase distributor until 
discharge of the request by the signal ack^SP. At this 
moment, the writing in operation will be effectively achieved 
by the parallel link management processor PGP. The phase 
distributor, on the next active transition of the clock h, 
comes to pass from the phase 824 to the phase 825. 
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The updating of the central memory RAM is terminated: 
the flip flop B20 is set to zero by activation of its input R 
by the phase 025 inverted. The flip flop B212 is set to one 
by activation of its input S by the phase 025, conditionned 
by the signal maj by means of the logic gate ET22 via the 
logic gate NDU22. In case of information defect in the cache 
memory, the purge of a block only constitutes the first part 
of the request: the request rqst_CJS is always present, but 
the liberation of the flip flop B21 is going to permit event- 
ual consideration of requests for operation while waiting in 
the waiting line FIFO. When the waiting line FIFO- is empty 
(empty ■ 0), the whole cycle, described previously is reproduc- 
ed, with, this time, the validation bit at zero." On is then 
in the following case: 

Second Case: maj = 0. This case arises when the valida- 
tion bit is equal to zero (by following perhaps, a purge of a 
block) or when the Validation bit is equal to one, but the 
modification bit is equal to zero: the current copy of this 
block is already in the central memory RAM. 

Thus, the request for service rqst_ SR comes to cause 
in return an accord for access to the management directory RG 
and to the cache memory MC with reading of the descriptor, 
storing the bits m and v, generation of the signal maj, re- 
writing of the descriptor with v * 0 (maj is at the logic 
state zero, the flip flop B31 is at the logic state zero and 
its output Q has a value of one, which imposes a logic signal 
at the state one oh the input sel20 of the multiplexer MOX20 
and therefor forces the constant 0) and the freeing of access 
to the management directory RG and the cache .memory MC. The 
operations are effective upon activation of the phase 02 3. 

The block request must now be read in the central mem- 
ory RAM. To this effect, the signal maj inverted is received 
at one of the inputs of a logic gate NET21 , which receives on 
its other input the phase 025. The output of the logic gate 
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NET 21 is connected to inputs s" of the flip flops B34 and B35. 
The flip fl op B34 receives on its input R the signal 
rin_reception coming from the transfer logic TFR. Its output 
Q is connected to the differentiator circuit D21. This cir- 
cuit is connected to the logic gate 0022. the input R of the 
flip flop B35 is connected to the signal grn\_SR and the out- 
put 5 on one of the inputs of the logic gate 0023. 

The reading of a block in the central memory RAM and 
its transfer into the cache memory are realized in the follow- 
ing manner: the transition of the phase 923 to the phase 92 4 
unlatches a service request to the parallel link management 
processor by activation of the line rqst_SP, coming from the 
flip flop B33. The type of operation is this time a reading 
<r_adr_x - 0) or a writing (w_adr_x = 0) and the bus adr_x 
furnishes the address of the block requested on the bus 
adr_bloc_SP. The phase distributor is blocked until arrival 
of the release signal aclc_SP: the reading or writing request 
has been made to the central memory RAM by the parallel con- 
nection management processor PGP, and the block has been at 
the same time validated and marked "in waiting" by this, same 
processor. the transfer is then in progress, from the cent- 
ral memory RAM to the shift register RDP. 

The phase distributor, freed, furnishes then the phase 
625. This phase, by the intermediary of the logic gate NET21 
which receives on its other input the signal maj inverted, 
comes to set to the logic state one, the flip flops B34 and 
B35. The flip f lop • B35 blocks the phase distributor. The 
flip flop B34 is reset to zero upon arrival of the block 
request i n the registe r RDP, signalled by the activation of 
the line f in_reception coming from the transfer logic TFR, 
which releases the activation of the differentiator circuit 
D21 and an access demand to the management directory RG and 
the cache memory MC, by the activation of the line rqit_SR 
from the flip flop B24. 
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The access accord, signalled by the line grnt_SR, 
frees the phase distributor by reseting the flip flops B25 
and B35 to zero, and opening the access to the management dir- 
ectory RG and to the cache memory MC. The phase distributor, 
upon the next active transition of the general clock h, furn- 
ishes the phase 026. 

The flip flop B27, connected to the signal wJRC of the 
management directory, is activated by Q26 and Q27 , which per- 
mits providing the operation of the management directory 
with: 

field tag of the bus dataJRG « field tag of the 

bus adr_x, 

validation bit v « 1 (maj » 0 and the output Q of 
the flip flop B31 is at zero: the multiplexer MUX20 allows v 
to pass, which has been forced to one by the parallel connec- 
tion management processor PGP, or already reset to zero by 
the snooper, processor PE) , 

modification bit m « state of the line r_adr_x of 
the bus adr_x (writing request causes m = 1, reading request 
m ■ 0) , 

awaiting transfer bit a » 0, 
bit f * 0. 

The flip flop B31 activates the signal w_MC of the 
cache memory MC of Q26 and Q27, which permits writing trhe 
contents of the shift register RDM (the buffers BV26 are 
validated by the output Q of the flip flop B31 and the signal 
maj inverted) in the good placement (field -cadre- of the bus 
adr_x connected to the bus adr_MC) of the cache memory, via 
the validation buffers BV22, controlled in the direction of 
tralsfer by the flip flop B31. 

Upon arrival of the phase 627, the operation of the 
management directory and of the cache memory* is terminated. 
The arrival of the phase 027 provokes the resetting of the 
flip flop B24 to zero, which frees the access to the manage- 
ment directory RG and to the cache memory MC and the 



56 



activation of the output of the logic gate ET23, resetting 
the flip flop B21 to zero and activating the signal ack_OS to 
the destination of the request management processor PGU: the* 
request is terminated. 

in addition , the snooper processor PE is charged with 
maintaining the coherence of the data in the cache memory MC. 
By way of example, there is * shown in FIGURE . 15 an architec- 
ture of this processor. This is released, at each active 
transition of the signal valid from the common bus BUSA. 

If the signal valid is activated by a parallel connec- 
tion management processor PGP other that that associated with 
the snooper processor PE # then the task of this latter is as 
follows, as a function of the request: 

request for reading of the block of non-shared data, 
or request for writing of the operation of the block: 
nothing, 

request for reading of the block of shared data: 
block absent: nothing, 

block present and not modified: nothing, 
block present and modified: request for purge of 
the block (that this block be present or in the course of 
transfer from the central memory RAM to the cache memory MC, 
state signalled by the bit -s-'of the descriptor), 

request for writing of the block of shared data: 
block absent: nothing, 

block present unmodified: invalidate the block, 
block present modified: request for purge of the 
block (same remark as above), 

request for informative writing of the block: 
block absent: nothing, 

block present unmodified: invalidate the block, 
block present modified: impossible case, 
if a request for informative writing is in wait- 
ing on this same block, it is transformed into a request for 
writing of the block, since this block has been invalidated: 
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to this effect, the request in progress is annulled and a 
release is sent to the request management processor PGU. 
This latter consults the directory RG and comes to find the 
block absent: a request is sent to the management processor 
PGS. This operation is taken into consideration by the para- 
llel management processor PGP. 

If the signal valid 'is activated by the parallel con- 
nection management processor PGP associated with the snooper 
processor, then the task of the latter is as follows, func- 
tion of the type of request (termed local request): 

request for reading of the block of non- shared 
data or request for writing of the operation of this block: 
nothing, 

request for reading of the block of. shared data: 
mark the block valid, while waiting for transfer and modified 
( v = m = s = 1 ) , 

request for informative writing of the block: 
mark the block "modified*' (m =1). 

To assure the functions described above, the snooper 
processor PE comprises a phasae distributor DP_E, comprising 
a shift register SR40 and a flip flop B40. The output of a 
differentiator circuit D40 is connected to the input S of the 
flip flop B40 and the input clear 40 of the register SR40, as 
well as the input S of a flip flop B.41. The output Q of the 
flip flop B40 is connected to. the input serial_in40 of the 
register SR40 . The input clk40 of the register SR40 receives 
the signal h from the general clock and the validation input 
en45* is connected to the output Q of the flip. flop B43. The 
phase B41, inverted, coming from the register SR40 is connect r 
ed to the input R of the flip flop B40. 

The operation - of the phase distributor is in accord- 
ance with the description given for the" request management 
processor PGU. 

The signal valid of the bus BOSA is connected to the 
input of -the differentiator circuit D40 and to the validation 
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input en41 of a decoder DEC 41. The flip flop R41 receives on 
its input R the output of the logic gate NOU40. The output Q 
of the flip flop B41, to. the open collector, is connected to 
the signal done of the bus BUSA. 

The signals valid and done assure the synchronization 
of the snooper processor PE with the other snooper processors 
of the multiprocessor system: the negative transition of the 
signal valid unlatches the differentiator circuit D40 which 
produces a pulse permitting activation of the phase distribut- 
or and sets the signal done at the logic state zero through 
the intermediate flip flop B41. The end of the wo.rk of the 
snooper is signalled by a change of state on the output of 
the logic gate NOU40, which produces the setting of the logic 
state of the signal done to one through the intermediary of 
the flip flop B 41. 

The work of the snooper is a function of the nature of 
the request and, to this effect, the field type of the bus 
BOSA is connected to the input of the decoder DEC 41 . The out- 
puts dnp and dmaj of the decoder DEC 41 are connected to the 
inputs of a logic gate OU40. The outputs di , de , dei of the 
decoder DEC41 are connected to the inputs of a logic gate 
0041, the outputs de and dei being both connected to the 
inputs ofr a logic gate OU42. The output of the logic gate 
OU40 is connected to one of the inputs of the logic gate 
NOU40 by the intermediary of a logic gate ET40, which 
receives on its other input the signal Q41. The output of 
the logic gate 0D41 is connected respectively to one of the 
inputs of the logic gates ET42, ET43, which receive on their 
other input respectively the phase signal Q41 and 944. 

The outputs of the logic gates ET42 and ET43 are con- 
nected respectively to the inputs sf and R of a flip flop B44. 
The output of the logic gate ET42 is also 'connected to the 
inputs S of the flip flops B42 and B43. A logic gate N0U41 
receives on its inputs the phase signal Q45 and the output of 
a logic gate ET45, which receives on its two inputs the phase 
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fr44 and the inverted output of a logic gate OU43. The output 
of the logic gate N0U41 is connected to the input R of the 
flip flop B42. The output (> of the flip flop B42 is connect- 
ed to the signal rqst_PR and the signal grnt_PR is delivered 
to the input R of the flip flop B43, to the validation inputs 
of the passage buffers BV40 , and to one of the inputs of the 
logic gate OU44. The logic gate OU44 receives on its other 
input the output Q of a flip flop B45 f and its output valid- 
ates the passage buffers BV41. The output of the logic gate 
ET45 is also connected to one of the inputs of the logic gate 
NOU40 which receives on another input the phase signal Q45 . 
the output of the logic gate 00 42 is connected to the inputs 
of the logic gates BT46 and ET47, which receive also on their 
inputs the output of a logic gate 0043 and respectively the 
phases $44 and ©45. 

The output "Q of the flip flop B44 delivers the signal 
r_RG via a buffer BV40, the output 5 of the flip flop B45 
delivers the signal wJRG via a buffer BV40 . The field cadre 
of the common bus BOSA. is connected to bus adr_RG via a 
buffer BV40. The bus data_RG is connecte4d to the outputs of 
the validation buffers BV41 and to the input of the register 
REG 40 , which receives on its input load40 the phase signal 
643. The output of the register REG 40 is connected, for the 
part tag, to the inputs of the buffers BV41 and to one of the 
inputs of a comparator COMP40. The comparator COMP40 
receives on its other input the part tag from the bus BOSA. 

The validation bit v, issued by the register REG 40 , 
connected to the comparator COMP40 has opposite the constant 
value 1. The bits v, a, m, f in the output , of the register 
REG 40 are connected respectively to one of the inputs of the 
multiplexers MOX40, M0X41, MUX42 , MUX43 . The outputs of 
these multiplexers furnish the state of these" same bits to 
the inputs of the buffers BV41. The multiplexer MUX40 
receives on its other inputs the constants zero and one, the 
inputs sel40 are connected to the output of a logic gate ET48 
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and to a signal dlle. The multiplexer MUX 41 receives on its 
other input the constant one, selected when its input. sel 41, 
receiving a signal dlle from the management processor PGP f is 
at the logic state of one. The multiplexer MUX 42 receives on 
its other inputs the constant 1 and the signal r_adrbloc_SR, 
on its inputs sel42 receiving the signals dlei and dlle com- 
ing from the management processor PGP. The multiplexer MUX 43 
receives on its other input the constant one, selected when 
its input sel43, connected to the output of a logic gate 
ET49, is at logic state one. 

The logic gate ET49 receives on its inputs the output 
eg 40 of the comparator COMP40, the signal f inverted and the 
signal m. The logic gate ET48 receives on its inputs the out- 
put eg40, the signal m inverted, the signal dlei and the out- 
put of the logic gate OU42. The logic gate OU43 receives on 
one of its inputs the signal eg 40, and on its other input the 
signal dlle. The output of the logic gate ET49 is also con- 
nected to the input load41 of the waiting line FIFO, already 
written in the series link management, processor PGS* The 
fields cadre and tag of the bus BUSA are connected to the 
inputs of the waiting line FIFO. The signals dlei, dlle and 
r_adrblock_SP come from the parallel link management 
processor PGP, which receives also the signal dei of the 
decoder DEC41. 

The operation of the assembly is as follows: the acti- 
vation signal valid initializes the phase distributor DPJB 
and validates the decoder DEC 41 which produces the activation 
of an output. as a function of the information type coding the 
nature of the demand in progress on the common bus BUSA . The 
active output may be: 

- dnp: request for reading of non-shared data. The 
signal done is deactivated upon appearance of the phase 841; 

- dmaj: writing request for operation of the block. 
The signal done is deactivated at phase 841; 

- dl: request for reading the block; 
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- de: request for writing the block; 

- dei: request for inforxaative writing. 

Theswe three cases necessitate an. access in reading to 
the directory RG, and the two latter an eventual re-writing 
of the directory. to this effect, a request for access is 
sent to the directory management processor PGR by the flip 
flop B42 (signal- rq!t_PR) the flip flop B43 blinding the 
phase distributor until access agreement f signified by the 
signal grnt_PR. The reading thus carried out of Q41 and Q44 
(flip flop B44) and the eventual writing of Q44 to Q45 (flip 
flop B45) with storage of the descriptor in the, register 
REG 40 from the phase 6 43. If the block is absent from the 
cache memory MC (eg40 = 0), then the signal done is deactivat- 
ed on the phase 844. If the block is present in the cache 
(eg40 = 1) , then: 

- if a ■ 1, a request to purge is activated (activa- 
tion logic gate ET49) on the condition that this block is not 
already in the line (f « 0) ; the only bit modified is f, set 
to one, at the time of re-writing of the descriptor, 

- if m * 0 r the block is invalidated by the multiplex- 
er MUX40 (activation of the logic gate ET4*8) , 

- if the request is local (dlle or diei active) then: 

1) in . case of reading or writing, the bits v and 
a are set to 1 and me is set to 0 (reading) or 1 (writing) 
(state of the signal r_adrbloc_SP) , 

in the case of informative writing, the bit m is 

forced to 1. 

In these : latter cases which require a re-writing in 
the management directory RC, the signal done is deactivated 
on the phase 945. 

The parallel link management processor PGP, an example 
of which is shown in FIGURE 16, is charged with requesting 
the common bus BUSA and providing the transaction requested , 
either by the request management processor PGO, or by the 
series link management processor PGS . 
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A request coming from the request management processor 
PGU can only be an informative writing request. a request 
coming- from the series link management processor PGS is 
either a request for reading or writing of the block, or a 
request for the current setting of the block. 

The parallel link management processor PGP comprises a 
flip flop B60, connected through its data input D to the 
signal rqstJJP. A flip flop B61 is connected through its 
data input D to the signal rqst_SP. the outputs Q and Q of a 
flip flop B62 are connected respectively to the clock inputs 
of flip flop B62s B60 and B61. theoutput Q of the .flip flop 
B62 is looped around to its data input. The outputs Q of the 
flip flops B60 and B61 are connected to the inputs of a logic 
gate 0060 . The output of the logic gate OU60 is connected, 
first, to a differentiator circuit D60, and in an inverse man- 
ner, to an input of a logic gate ET60, which receives on its 
other input the general clock signal h. The output of the 
logic gate ET60 is connected to the clock input of the flip 
flop B62. The output of the differentiator circuit D60 is 
connected to the input S of a flip f lop B6 3. The output Q of 
the flip flop B63 delivers a signal rq*stl to the destination 
of the arbitrator AB, and its input R is connected to the out- 
put of a logic gate ET62. The signal grlTtl coming from the 
arbitrator AB is connected to the logic gates 0062 and NOU60. 
the l ogic gate OU62 receives on its other input the signal 
valid inverted, and its output is connected to the input of a 
differentiator circuit D61. The output of this circuit D61 
is connected to one of the inputs of the logic gate ET62 and 
to the input 2 of a flip flop B64. The output Q of this flip 
flop B64 is connected to the validation input of the passage 
buffers BV60 and in an inverted maner, across an inverter 
with an °P en collector ISO', to the signal vaTIa", The signal' 
done is connected to the input of a differentiator D62 . the 
output of this circuit D62 is connected to the input R of the 
flip flop B64 and to one of the inputs of the logic gate 
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NOU60 . The output of this logic gate NOU60 is connected to 
one of the input logic gates NET60 and NET61, which receive 
on their other input, respectively the outputs Q of the flip 
flops B60 and B61. The outputs Q of the flip flops B60 and 
B61 are also connected respectively to the validation inputs 
of the passage buffers BV61 and BV62 . The output of the 
logic gate NET60 is connected to the input S of the flip flop 
B60 and to one of the inputs of a logic gate ET63, which 
receives on its other, input the output of a differentiator 
circuit D63. The output of the logic gate ET63 delivers the 
signal acq_UP to the management processor PGU. The output of 
the logic gate ET63 delivers the signal acg_UP to the manage- 
ment processor PGU • The output of the logic gate NET 61 is 
connected to the input sf of the flip flop B61 and furnishes 
the signal "aclc_SP, The bus adr_bloc_UP is connected to the 
input of the validation buffers BV61 and to one of the inputs 
of a comparator COMP60* 

The bus adr_bloc_SP is connected to the input of the 
validation buffers BV62. The outputs of the buf f ers .BV61 and 
BV62 are connected together and to the input of the 
validation buffer BV60 . The output of the buffer BV60 is 
connected to the common bus BUS A. The logic gates OU63 and 
OU64 receive on their inputs respectively the outputs Q of 
the flip flops B60 and B61, the logic signal grnti and the 
signal maj for OU64. The output of the logic gate OU63 
delivers the signal dlei, and the output of the logic gate 
OU64 the signal dlle. The other input of comparator COMP60 
receives the fields tag and cadre from the common .bus BUSA. 
The input en60 of the comparator COMP60 is connected to the 
output of the logic gate ET61, which receives on its inputs 
the signal grnti inverted, the- signal dei and the signal 
rqstJJP inverted. The output eg60 of the comparator COMP50 
is connected to the input of the differentiator circuit D63. 
The output of this circuit is also connected to the inputs R 
of the flip flops B60 and B61 and the other input of the 
logic gate ET62. 



The operation of the assembly is as follows: 
The flip flops B61 and B62 associated with the logic gates 
ET60 and ET61 constitute a local arbitrator. This arbitrator 
examines alternatively the requests rqst_UP and rqiHE^SP, and 
resends them to the arbitrator AB of the common bus BUS A by 
the signal rqstl . The access accord is furnished by the val- 
idation of the signal grnti and cycle bus takes place upon 
deactivation of the signal' valid which .frees the arbitrator 
AB. The activation of the signal done frees the local 
arbitrator: the transaction is terminated. 

If the request comes from the series link management 
processor PGS, then the signals die! and dlle indicate to the 
associated snooper processor PE the nature of the updating of 
the status bits of the block to be carried out in the direct- 
ory RG . 

if the request comes from the request management proc- 
essor PGU f the in case of detection of informative writing on 
the same block (signal dei coming from the snooper processor 
PE) , an. immediate liberation takes place: the request manage- 
ment processor PGU, after consultation of the directory (the 
block has been invalidated) will direct its request to the 
series link management processor PGS . 

The memory management processor PGM, one example of 
which is shown in FIGURE 17 , is charged with assuring the 
reading or writing of the block in the central memory RAM and 
to participate in maintaining the coherence of the informa- 
tion in the various cache memories MC of the multiprocessor 
system. 

To this e.nd, it comprises a differentiator circuit D80 
receiving on its input a signal valid and connected by its 
output to the. inputs S of the flip flops B80 and B81 as well 
as to the input clr_8Q of a shift register SR80. On the put- 
put Q of the flip flop B80, with an open collector, is deliv- 
ered the signal done. The output Q of the flip flop B81 is 
connected to the input serial_in80 of the register SR80. 
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This flip flop B81 is connected by its input R to the output 
Q81, inverted, of the register SR80 . The register SR80 
receives oti its input clk80 the general clock signal h and 
its validation input "enSTJ is always active. The flip flop 
B81 and the shift register SR80 constitute a phase distribut- 
or DP_M. The signal valid is also delivered on the valida- 
tion input enST of a decoder DEC 80 . This decoder is connect- 
ed by its data input to the part type of the common bus BOSA, 
and furnishes the signals dnp f dl, de, ei and maj. A read- 
write memory RAMFG of a size of two bits (termed respectively 
ro and rw) receives on its address bus the fields tag and 
cadre of the common bus BOS A. The data bus of this memory, 
constituted of bits ro and rw, is connected by one part to a 
logic PAL80, and to a logic gate ET80, directly for rw, and 
inverted for ro. The logic PAL80is connected to the field 
type and receives the logic signals cl, r/w, s/n, mff and 
en82: the signal cl is emitted from a flip flop B82 , the 
signals r/w and s/n from an associative waiting line AFIPO, 
the signal mff from a logic gate ET81, and the signal en82 
from a flip flop B83, which receives on its inputs S and R 
respectively the signals 082 and 081 inverted from the phase 
distributor DP__M. The logic PAL cables on its outputs ro-rw 
the following logic equations: dnp = 10; dl.mff ■ 10; dl.mff 
= 01; de = 01; maj.cl = 00; ei - 01; maj.cl.s/n.r7w = 10; 
maj.cl.s/n.r7w - 01. The output of the logic gate ET80 is 
connected to the input D of a flip flop B84 which receives on 
its clock input the phase 682. The output Q of this flip 
flop is connected to one of the inputs of the logic gate 
ET81, which receives on its other input the output of the 
logic gate 0080. The two inputs of this logic gate 0080 are 
connected to the outputs de and dl of the decoder DEC 80 . The 
reading input r of the memory RAMFG is connected to the phase 
881, and the writing input w to the output of a logic gate 
ET82. The logic gate ET82 receives on its inputs the signal 
.083 and the output of a logic gate ET83, the inputs of which 
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are connected to the signal s/n and the phase 9-87. The out- 
put dnp of the decoder DSC80 is connected to the logic gates 
ET84 and ET85, which receive on their other input respective- 
ly the phases 081 and 985. The signal s/n is also delivered 
to the logic gates ET86 and ET87 which receive on their other 
input respectively the phases 086 and 990. The output mff of 
the logic gate ET81 is also connected to a logic gate ET88, 
which receives on its other -input the phase 983/ and in an in- 
verted manner to the logic gates ET89 and ET90 which receive 
on their other input respectively the phases -983 and 987. 
The output of the logic gate ET88 is connected to the input 
wff of the line AFIFO. The outputs of the logic gates ET84, 
ET86 and ET89 are connected to the. inputs of a logic gate 
0U81, the output of which is connected to the input S of a 
flip flop B85. The outputs of the logic gates ET85, ET87 , 
ET90 are connected to the inputs of a logic gate OCJ82 of 
which the output is connected to the input R of the flip flop 
R85. The signal s/n is also, connected/ in an inverted man- 
ner, to a logic gate ET91 which receives on its other input 
the phase 089* The output of the logic gate ET91 is connect- 
ed to the input of a logic gate NOO80 which receives on its 
other input the output of the logic gate OU82. The output of 
the logic gate NOU80 is connected to the input R of the flip 
flop B80. The output maj of the decoder DEC80 is connected 
to the input of the logic gates ET92, ET93, ET94, ET95 , which 
receive on their other input respectively the phases 981, 
985, 985, 991. The outputs of the logic gates ET92 and ET93 
inverted are connected respectively to the inputs S and R of 
a flip flop B86, and those of logic 'gates ET94 and ET95 to 
the inputs S and R of a flip flop B82 . The output Q of the 
flip flop B82 produces the logic signal cl also delivered to 
the input cff of the line AFIFO and ,to the control input 
se!80 of a multiplexer MUX80 . The part tag, cadre of the 
common bus BOS A is connected to the data input of the line 
AEIFO and to one of the data inputs of the multiplexer MOX80 . 
The output dl of the decoder DEC80 is also connected to one 
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of the data inputs of the line AFIFO in order to produce the 
reading/writing signal l/ e . The data output of the lint 
AFIFO is connected to the other input of the multiplexer 
MUX 80 . The output of the multiplexer MUX80 is connected to 
the address bus of the central memory RAM for the part 
tag. cadre, and to the inputs of the decoders DEC81 and DEC82 
for the part champ cpu. The output Q of the flip flop B86 is 
connected to the writing input of the central memory RAM and 
to the input en8 4 of the decoder DEC82 . The output Q of the 
flip flop B85, slightly retarded, is connected to the reading 
input of the central memory RAM and to one of the inputs of a 
logic gate ET96, which receives on its other input the output 
of the logic gate OU82.. The output of hthe logic gate ET96 
is connected to the input en83 of the decoder DEC81. The out- 
put j of the decoder DEC81 is connected to the validation 
input of the passage buffers of the memory shift register 
RDMj and the output j of the decoder DEC82 to the loading 
input of said memory shift registerRDM.. . 

The operation of . this assembly is as follows: 
The activation of the signal valid provokes the un- 
latching of the phase distributor DP_H, and the validation of 
the decoder DEC80 , which comes to permit the determination of 
the nature of the request. The phase 081 is used for reading 
the state of the bits corresponding to the block requested in 
the memory RAMFG, and the combination ro.rw is stored in the 
flip flop B84. A first writing takes place in the memory 
RAMFG on the phase 083 which permits updating the status 
bits. Their value is furnished by the logic PAL80 and per- 
mits obtaining the following chains: 

- in case of request for a block of non-shared data 
(dnp) then regardless of the state of the bits ro.rw (rw is 
forced to zero), the state 10 is forced ("block broadcast in 
reading") ; 
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- in case of request Eor a block for reading (dl) or 
writing (de), if ro.rw = 01, then the request is placed in 
the waiting line on the phase 983 and the state 01 is forced 
(in fact this is the preceding state), if not the state 10 is 
forced in case of reading ("block broadcast in reading") and 
the state 01 is forced in case of writing ("block broadcast 
in writing") ; 

- in case of request for updating (maj), the state 00 
is forced ("block not broadcast"). In these various cases, a 
reading or a writing into the central memory RAM is carried 
out, toward or away from the memory shift register RDM., 
identified by the field cpu of the common bus BOS A. In the 
example chosen, the duration of the memory cycle RAM is 4 
periods of the general clock h. In the case of reading of 
non-shared data, the cycle is carried out from &81 to 885, in 
the other cases from 883 to 887. The writing is carried out 
from 981 to 885 ; 

- in case of informative writing, this does not cause 
movement of data, but forces the status bits to the value 01 
(state of departure is in this case forqedly 10)? 

- in case of request for updating, a consultation of 
the waiting line AFIFO is systematically carried out. This 
consultation may cause the reading of a block, in the case 
where a central unit CPU is waiting in the line AFIFO for 
operating this block. 

The reading is carried out from 086 to 090 and the 
state of the bits is forced to 10 (request tor a reading) or 
01 (request for writing). The end of all operation trans- 
lates by resetting to zero the flip flop B80 which activates 
the signal done. This deactivation may be produced on the 
phases 085, 987 or 991 according to the requested ^operation, 
or on 989 if the consultation of the line gives a negative 
result. 

The associative waiting line is not detailed. It con- 
stitutes in a conventional manner an associative store used 
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in the waiting line. the number of words o£ this memory is 
equal to the number of the central unit CPU of the multipro- 
cessor system. An internal "daisy chain" identifies on each 
phase 081 the next candidate word to a writing, which is 
produced on the phase 983 by the signal wff. The signal cff 
unlatches a comparison starting fromthe phase 685 , the flip 
flops of the response rriemo.ry having been reset to zero on the 
phase 684. The result of the comparison is propagated on the 
signal s/n (some/none) and the contents of the word concerned 
is available on the data output from the phase 986. This 
word is then invalid on the phase 690. 

In the architecture described above, the snooper 
processors PEj are urged to each address transfer on the 
common bus BCJSA, with eventual consultation of their cache 
memory MCj. This consultation is most of the time useless 
(low probability of the presence of the address of the block 
corresponding to the transfer, in the cache memories). 

It should be .noted that the memory management process- 
or PGM keeps the state ' bits of the blocks and makes possible 
a central management for maintaining coherence. To this 
effect, one can add to the architecture described above 
(FIGURE 10) a parallel synchronization bus operating accord- 
ing to the same algorithm as the synchronization bus SYNCHRO 
of the variation which is described hereafter. The snooper 
processors are no longer then properly spoken of a snoopers 
(since connected on the synchronization bus and not on the 
common bus BUSA) , and are designed as coherence maintenance 
processors (PMC., for the variation in FIGURE 18). Thus, 
the memory management processor PGM remains urged to each 
transfer on the common bus BUSA, but the coherence mainten- 
ance processors are urged by the processor PGM only when they 
are concerned by the transfer. 

FIGURE 18 presents a variation in which the coherence 
is maintained according to the principle discussed above. 
This variation takes the general architecture of FIGURE 6., 
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with the address of the blocks which transfer by the series 
links LSj. This system comprises a parallel synchro- 
nization bus SYNCHRO, of the same logic structure as the com- 
mon bus BUSA, but directed at the sole initiative of the 
memory management processor PGM. 

The structure of the central unit UC,. is conformed 
to that presented in FIGURE; 0L0 , with some modifications: 

- the structure of the cache memory MCj remains the 
same, as well as the structure of the management directory 
RG j f 

- the parallel management processor PGPj disappears, 
since the common bus BUSA no longer exists and the functions 
which' had been attributed to it are turned over to the series 
link management processor PGSj; 

- the snooper processor PE^ is replaced by a coher- 
ence maintainance processor PMCj which takes care of keep- 
ing the state bits of the blocks in the cache memory MCj in 
order to assure the coherence and which is active at the sole 
initiative of the memory management processor PGM j via the 
synchronization bus SYNCHRO; 

- the request management processor PGU no longer knows 
a single partenary: the series link management processor 
PGS^r on which it reports all its requests; 

the series link management processor PGS . is 
charged with the transfer of the addresses and the data, con- 
forming to the principle described for the system of FIGURE 
6, each address being prefixed by the nature op the request; 

- the functionalities of the memory management process-* 
or PGM are those described in reference to FIGURE 17, its act- 
ivation no longer being assurred by the signal valid, which 
disappears (since previously associated with the common bus 
BUSA) , but by the arbitrator ABM described in the system of 
FIGURE 6, which serializes the requests for service which 
move by the series links. The memory RAMFG is also constitut- 
ed by a supplmentary field cpu associated with the state bits 
ro.rw. 



71 



The general functionning of the embodiment shown in 
FIGURE 18 is as follows: 

Each request of the treatment processor CPU activates 
the request management processor PGUj with the reading or 
writing indication and code or data. This processor requires 
an access to the management directory RG j after the direct- 
ory management processor in comparison with the directory man- 
agement processor PGB j . The consultation of the directory 
leads to one of the following cases: 

- the block is present in the cache memory MCj, with 
an unmodified state (m = 0): if the request is for reading, 
the information requested is extracted from the cache memory 
MCj and furnished to the treatment processor CPUj«* If 
the request is for writing, then an informative writing 
request ei is transmitted to the series link management pro- 
cessor PGSj; 

- the block is present in the cache memory MCj, with 
a modified state (m - 1); the request,- reading or writing, is 
satisfied; 

- the block is absent from the cache memory MC^; a 
block request for reading "or writing is transmitted to the 
series link management processor PGSj . 

Thus, the requests made to the series link management 
processor may be: a request for reading non-shared data 
(code): dnp, a request for reading the block: dl, a request 
for reading of the block in order to carry out there a 
writing: de , a request for informative writing: ei . 

At these various states, it is necessary to add the 
state maj corresponding to the purging of a block, either at 
the request of the coherence maintenance processor PMCj, or 
for liberating a placement of a block in the cache memory. 
"The' addresses thus prefixed move by the series links LSj, 
and conformingly to the principle set out during the descrip- 
tion of the architecture of FIGURE 6, urge the arbitrator ABM 
when: 
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in case of reading a block, the address is 
transmitted, 

- in case of writing a block, the address and the data 
are transmitted. 

These requests are treated sequentially by the memory 
management processor PGM, of the same general structure as 
that described in FIGURE 17. Their treatment is as follows: 

1/ dnp: request for non-shared data. The block is 
transmitted and takes the state ro.rw = 10. 

2/ dl: request for reading of a block. 

If the block is in the non-broadcast state (ro.rw = 
00) or "broadcast in reading- (ro.rw - 10), it is transmitted 
and takes or keeps the state ro.rw » 01. 

If the block is in the state of broadcast in writing, 
the request is sent into the waiting line AFIFO. The memory 
management processor. PRM then finds in the field cpu of the 
memory RAMFG the address of the cache memory MCi which con- 
tains the current version of the requested block. A request 
for purging is then emitted on the synchronization bus 
SYNCHRO, to the sole destination of the coherence maintenance 
processor pmc^ associated' with the cache memory MC con- 
cerned. This request may be qualified by the addressed 
command . 

One wil note that the coherence maintenance processor 
PMC. has not consulted the associated management directory 
RG L since the memory management processor PGM knows of the 
fact that it is the only possessor of the current copy. Its 
role consists simply of removing the request and depositing 
it in the associated file FIFO^ . 

3/ de: request for reading of a block in order to 
make, a writing . 

-If the block is in the "non-broadcast" state (ro.rw = 
00), it is transmitted and takes the "broadcast in writing- 
state (ro.rw a* 01). 

If the block is in the "broadcast in reading" state 
(ro.rw - 10), then the memory management processor emits, a 
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universal block invalidation command, then transmits the bloc 
with the state "broadcast in writing" (ro.rw - 01>. the univ- 
ersal command provokes the activation of all of the coherence 
maintenance processors PMC^ which execute strictly the same 
operations as those described for the system of FIGURE 10. 

If the block is in the "broadcast in writing" state, 
the request is sent to the waiting line AFTFQ. As before, 
the memory management processor PGM sends a command addressed 
to the only possessor of the current copy . 

4/ maj: request for writing of a block after a purge. 
The operating algorithm is in this case strictly the 
same as that described in reference to FIGORE 17 for the PGM 
processor. 

It will be. noted that the problem of discharge of the 
writing naturally found its solution in this embodiment by an 
addressed command for discharge. 

5/ ei: informative writing. 

This case is treated directly on the commun bus BUSA 
in the architecture presented in FIGURE 10. In the 
embodiment provided here, and in order ot assure the 
synchronization, this operation is controlled by the memory 
management processor PGM. 

If the block is in the state "broadcast in reading", 
then a command, at the time universal and addressed, is sent 
out: addressed in the sense that the coherence maintenance 
processor PMCj concerned notes the discharge of the request 
for informative writing and passes the block concerned into 
the state "modified" in the management directory RGj, 
universal in the sense that all the other processors PMC^ 
must invalidate this block in their directory. 

The block . in the "broadcast in writing" state indi- 
cates that an informative writing request has been treated on 
this same block during the time waiting for treatment of the 
request. In this case, the informative writing request is 
transformed into a writing request de , and follows the same 
treatment as in the corresponding case of writing . 
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The parallel synchronization, bus SYNCHRO is charged 
with broadcasting the block addresses prefixed by a processor 
number and a type of request, to be about 30 to 40 bits 
according to the characteristics of the multiprocessor. 
These data are moreover transmitted in a unidirectional man- 
ner. Their transfer may advantageously be made there by a 
series link. Their transfer rate is less critical than for 
the block, and simplified solutions may therefor be envision- 
ed, for example by the bias of "TAXI" circuits made by the 
A.M*D . Company. 

FIGURE 19 shows a partial block diagram of .one archi- 
tecture according to the invention, in which several central 
units UC^... aire Connected in clusters and share the same 
series link LS fc . In this goal, a local arbitrator ABL^ 
associated with the cluster is charged with arbitrating the 
access conflicts by means of communication of block 
addresses, and sharing with the memory processor PGM, trans- 
ported for this purpose, a signal busy^ indicating perman- 
ently the free state or occupied by the series link LS^. 
Means for coding and decoding of one at the head of the 
identification of the processor concerned at the interior of 
a dustier are associated with emission logic and reception of 
blocks of data. 

In the case where the address communication means for 
blocks are constituted by the common bus BUSA, the operation 
is as follows: 

If the central unit CPCJ Jc+ j desires to make a trans- 
fer of a block in the direction of the central memory RAM to 
a cache memory MC k+ j (case dnp, dl, de) or carry out an 
informative writing ei , then it requires access to the common 
bus BUSA by the local arbitrator ABL^, which propagates the 
request, its turn coming, to the arbitrator AB . The accord 
for access to the common bus BUSA is re- sent to the central 
unit CPa Jc+ j and the transfer is carried out in the manner 
described in reference to . FIGURE 10 „ All blocks transmitted 
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in the direction of the central memory RAM to the cache mem- 
ory MC k+ . must then be identified, since the order of the 
requests 3 is not respected of the fact of the possibility of 
being waiting in the line AFIPO of the memory management pro- 
cessor PGM. If the central unit CPU k+j desires to make a 
transfer of a clock in the direction of the cache memory 
MC k+j toward the central memory RAM (case maj),. then it 
requests first of the local arbitrator ABL k access to the 
series link LS^. The local arbitrator ABL k and the mem- 
ory management processor PGM are both susceptible of authoriz- 
ing the series link LS k : the contention is avoided by sync- 
hronizing the modification, of the signal busy k with the 
signal valid (the memory management processor PGM only activ- 
ates or reactivates a transfer during a memory transaction) . 
The occupation accord of the series link LS k leads the cen- 
tral unit CPU k+;j to transfer its block of information to 
the memory shift register RDM k , then to request the local 
arbitrator ABI» k for access to the common bus BOSA in order 
to carry out there the operation request, which is carried 
out according to the algorithm described in reference to 
FIGURE 10. The writing of the operation may cause a libera- 
tion of the block in the waiting line AFIFO of the memory man- 
agement processor PGM and request an occupied shift register 
RDM.. In this case, the transfer requested is delayed and 
chained with the transfer in progress. 

In the case where the block address communication 
means is the series links themselves, the operation is then 
identical to the preceding case for the preemption of the ser- 
ies link, and identical for the general operating algorithm 
to. that shown with reference to FIGURE 17. 

For example,. a reading request of the block emitted by 
the central unit CPO k+; .necessitates first an access accord 
to the series link LS k , an accord given by the local arbit- 
rator ABL k in concert with the memory management processor 
PGM. The access accord causes the' transfer of the address of 
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the block requested on the series link LS , which is,immedi- 
ately liberated: it is available for any other transaction 
if necessary. A writing request of a block follows the same 
protocol for access to the series link LS^* 

In the architecture described with reference to 
FIGURES 1 to 19 , there exist as many memory shift registers 
RDMj as central units CPCJj: one series link LSj being 
provided in a static manner' to a couple (RDMj*, CPU j ) . 

If there must evidently be at least one series link 
LSj between a central unit and the central memory RAM, the 
number of shift registers RDM., may be less. In effect, if 
tacc is the time for access to the central memory RAM, and 
ttfr the time for transfer of ,a block, it is not possible to 
keep more than n = ttfr/tacc shift registers simultaneously 
occupied. For example*, for tacc « 100 ns and ttfr - 1200 ns, 
one obtains n » 12 . 

Tacc and ttfr are then the criteria characteristic of 
the performance of the multiprocessor system according to the 
invention and putting in place n memory shift registers 
RDMj is compatible with a greater number of series links 
LSj that the condition of interspersing a logic of the 
interconnection network type RI between registers and links, 
the allocation of a memory register RDMj to a series link 
LSj being achieved in a dynamic manner by the memory manage- 
ment . processor PGM. 

Further, the central memory RAM wiil generally consti- 
tute m memory banks RAM^, . . . RAMp, RAM m arranged in para- 
llel, each memory bank comprising n shift registers RDMj 
connected by an interconnection network RI p to the assembly 
of series links LSj. It is then possible, under reserve 
that the addresses of the blocks be uniformly distributed in 
the memory bank« RAMpf to obtain a theoretical performance 
of m x n shift registers simultaneously active. The uniform 
distribution of the addresses is assurred by conventional 
mechanisms for interlacing the addresses. 
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In FIGURE 20a, there is partially shown an architec- 
ture conforming to the invention, comprising m memory banks 
RAM with n shift registers RDM.* per- memory bank and q 

P ^ 

central units UCj. Each memory bank is of the random 
access type provided with an input/output of data of a size 
corresponding to a block of information bi , this input/output 
*being (as before for the memory RAM) connected by a parallel 
bus to the assembly of elementary registers RDM lp . .RDMj p . 

The interconnection network is of a known structure 

("cross-bar," "delta," "banyan,"...). One will note that a 
multi-stage network is well suited in the case where the time 

for establishing ' a path is negligible before its time of 

occupation (the time of transfer of a block) and that it only 

concerns one bit per link. 

The memory management processor PGM is adapted to be 

assured of the dynamic allocation of an output and an input 

of the network, that is putting in relation a memory shift 

register. RDMj and a series link LS^ . 

In the case where the block address communication. 

means constitutes the common bus BUSA, the operation is as 

follows: 

In the case of a request for reading of a block of the 
part of the central unit CPUj, the memory management 
processor PGM p concerned allocates a shift register RDM^ t 
controls the interconnection network RI as a consequence, and 
initializes the transfer. 

In the case of a request for writing of a block of the 
part of the central unit CPU^, a path must first be estab- 
lished. To this end, a first request for establishment of 
the path is sent on the common bus BOS A, followed by the writ- 
ing request effecting the transfer of the block from the 
cache memory MCj "to the shift register RDM^ . At the time 
of the first request, the memory management processor PGM is 
charged with allocating a path and controlling the intercon- 
nection network RI. 
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In the case where the block address communication 
means is the series links themselves, a path must be estab- 
lished for all preliminary transfer. This problem is identi- 
cal to the classical problem of sharing of an assembly of n 
resources by m users and may be controlled by classical solu- 
tions for arbitration of access conflicts (communication 
protocols, supplementary signals). 

In the example of the architecture shown in FIGURE 5, 
the shift registers RDMj and RDP j , their validation logic 
LVl and LV2 were provided with high speed, the assembly being 
synchronized by a clock of a frequency F at least equal to 
100 MHz. 

FIGURE 20b shows a variation of the architecture pro- 
posed in FIGURE 20a, a solution conforming to the invention 
in which each series link LS^, which connects the processor 
CPU. to all of the memory banks, is divided into m series 
links LSj p connecting by point to point the processor 
CPU • to each of the memory banks RAM p . 

This process presents the double advantage following: 

- each link being of the point to point type, may be 
better adapted from the electrical poi-nt of view or the fiber 
optic point of view, 

- a level of supplemental parallelism is obtained 
starting from when the treatment processor is in the state of 
anticipating block requests, which is the case actually for 
the for the highest performance processors. 

The interface logic (noted previously TFR j and 
RDP.) which is associated with the- series link LS.., side 
processor CPU.., is then duplicated into m copies I 1 ...I p _ 
...I . One will note the presence of a link for maintain- 
ing the coherence of information, proprietary to each memory 
bank RAM p . The operation of this link is analogous to 
that of the bus SYNCHRO of FIGURE 18. 

There is shown in FIGURES 21a and 21b another struc- 
ture of the memory RAM which comprises 2 U memory planes, 
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each memory plane having a front of t/2 u binary data (for 
reasons of clarity, in FIGURE 212a is shown the means necess- 
ary for the reading of a block bi , and in FIGURE 21b the 
means necessary for writing) . The shift registers RDM j or 
RDPj are constituted of 2 U elementary sub-shift registers 

RDM • with t/2 u bits of capacity. The example shown in 
DP - 

FIGURE 20 is a realization of 8 memory planes (u-3) . (For 
clarity of the drawing, there is shown a single shift regist- 
er RDM j formed for the assembly of sub-registers RDMj p ) . 
Each memory plane RAM comprises opposite its access front 

ST 

an assembly of elementary shift registers RDM. aild is cap- 

u 

able of operating with a shift frequency of at least F/2 . 

The operation of the assembly in case of reading is 
illustrated in FIGURE 21a. A block is read in a synchronous 
manner in the assembly of 2 U memory planes and loaded in 
the same manner in the elementary registers of the same rank. 
The series outputs of these registers are connected to the 
inputs of a multiplexer MDXR made of high speed technology 
(ASGA). A circuit of this type, perfectly adapted, is avail- 
able at "Gigabit Logic" under the name "LOGO40." , and is cap- 
able of delivering a logic signal at a frequency of 2.7 GHz. 
It also furnishes a clock frequency divided by eight, which 
constitutes the clock for shift of the elementary registers 

In the case of writing > a symmetrical operation, shown 
in FIGURE 21b, is obtained with a multiplexer circuit DMUXR 
of the same manufacturer (known as "L0G41")., with the same 
characteristics of performance. 

Thus, a transfer frequency of 500 MHz is obtained with 
8 elementary registers operating at a frequency of 500/8 = 
62.5 MHz, which makes then able to be accomplished with more 
conventional technology CMOS" for example) . 

The multiplexer and demultiplexer circuits reference 
hereabove are combinable into assemblies of 16, 32,... bits. 
Thus, by associating respectively 16, 32 memory planes 
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operating at 62.5 MHz, it is possible to obtain flow rates of 
1 and 2 GHz, or a level of performance 2 to 4 times greater. 

It should be noted that the logic TFR may- be provided 
on one of the elementary registers, and that the validation 
logic LV is integrated with the circuits "ASGA" (output with 
open collector). 

FIGURE • 22 shows the, general structure of a component 
of the "VLSI" integrated circuit type, called a multiport 
series memory and capable of equipping a multiprocessor sys- 
tem according to the invention. This component may be used 
in multiprocessor architecture described previously, either 
to provide the central memory RAM and the associated shift 
registers RDMj, or for proftding each cache memory MC^ 
and its shift register RDP j . To simplify the notations, in 
the description which follows, the symbols relative to the 
central memory RAM and the associated shift registers will be 
retained. 

The list of pins of this circuit with the correspond- 
ing signals is as follows: 

- adbloc n -adbloc , : m address bits of block bi, 

u m— l 

- admotQ-admot^^ : k bits of word addresses in 
the block, 

numregQ-numreg^^ : n bits of register 
addresses rd, 

- cl? : "chip select" : circuit selection signal, 

- wr : "write" : signal for writing, 

- rcT : "read" : signal for reading, 

bit/bloc : ^control signal for the multiport 

function, 

- normal/conf ig : signal for the operating mode, 
* - datap-data^^ : 1 bits of data, 

- h 2.-h n : n clock signals, 

- d^-d n : n data signals. 

The values m, n,. 1 are functions of the current state 
of technology. The actual values may be as follows: 
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- m ■ 16 or 2 ±0 blocks bi of 64 bits each (or 4 

Mbits), 

- n « 3 or 8 registers rd, 

- 1 - 8 or a parallel interface of the octet type, 

- k = 3 because of the fact of 8 octets per block. 
The component provided comprises about 50 pins. 

This series multiport memory circuit is composed of a 
random access read-write memory RAM, of a predetermined size 
t, capable of being controlled in writing on the fronts inde- 
pendent of the size t/4 (value chosen by way of example in 
FIGURE 22) and t/1 . The data lines of this memory RAM are 
connected to the inputs of a logic of the "barrel shifter" 
type B3, or of the multiplexing MT , depending on the version 
of the component, the multiplexing logic MT may be considered 
as offering a sub-assembly of possibilities of barrel shifter 
logic and therefor simpler to provide. The address signals 
and the control signals of this memory RAM, namely csi, wri, 
rdi, adbloci, are delivered from a control logic COM. This 
logic COM receives further the information signals* from the 
pins cs, wr, rcT, bit/bloc, normal/ conf ig , numreg and is con- 
nected by the control lines "format" to the barrel* shifter 
logic 3S, and to the output of a configuration register RCl, 
and to the input of a- selection logic LSR providing the sig- 
nals srd Q , . . . srd n-1 and src^, src 2' src 3 - The 
outputs, of the barrel shifter type of logic GS constitute an 
internal parallel communication bus BUSI, connected to an 
assembly of shift registers RDg, . RD^^, on their para- 
llel inputs, for one part, on their parallel inputs, and for 
the other part on their parallel outputs through the valida- 
tion buffers BV100 Q ,... BVlOO^^^ and to the parallel 
input of. the configuration registers RC^, RC 2 ... RC i . 

The 1 bits of low significance of the bus BUSI" are 
also received on the 1 pins- data Q ,... data^-^. Bach 
shift register RD i and the associated logic gates consti- 
tute a functional unit ELRD- , controlled by an assembly of 
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logic elements which constitute a forcing logic LF^ . Each 
functional unit ELRD^ comprises logic gates ETlOO i and 
ETlOl^ connected on one of their inputs to the output 
srd i of the selection logic LSR, and receiving on their 
other input respectively the signals rd £ and wr i - The 
output of the logic gate ET100 L is connected to the input 
loadlOC^ of the shift register RD i , as well as the input 
loadlOli and the input S respectively of a counter 
CPTlOOj^ and of a flip flop BlOC^ appurtaining to the forc- 
ing logic LF L . The output of the logic gate ETlOlj^ is 
connected to the control input of the validation buffers 
BV100-. The output di is connected to the output of a 
logic gate PL if which receives on its data input the series 
output of the shift register RD i and on its control input 
the output of a logic gate OUl00 jL . The signal sent from 
the pin h L is delivered to the input clklOCU of the regis- 
ter rd. as well as the input downl00 i of the counter 
CPTIOC^. The output zero 10 Q L of the counter CPTlOO i is 
connected to the input R of the flip flop BlOO^. 

The forcing logic LPj_ comprises in addition a multi- 
plexer MUXlOO^^ which receives on its data inputs the values 
t and t/4. The data output of the multiplexer MUXlOOj^ is 
conected to the data input of the counter CPTlOO^ and the 
selection command sellOOj^ of the multiplexer MUXIOC^ is 
connected to the output 1 of the register RC^. The output 
Q of the flip flop Bl00 i is connected to one of the inputs 
of a logic gate ETl02 if which- receives on its other input 
the signal issued from the pin i of a register RC 2 . The 
output of the logic gate ETl02 i is connected to one of the 
inputs of the logic gate OUIQO^ which receives on its 
other input the signal issued on the pin i of a register 
RC 3 - The loading inputs of the registers RC^, RC 2 , 
RC 3 receive respectively the. signals src^ src 2 , src 3 
issued from the selection logic LSR. 

This component presents a duality of f unctionning : if 
the signal- bit/bT3c is in the "bit" ' state, then the 
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functionhing of this component is that of a conventional semi- 
conductor memory: the signals adbloc associated with the sig- 
nals admot constitute the address bus in the word unit* (8 
bits in the example), the signals cs , rcT, wr have the usual 
direction attributed to these signals, and the data pins 
convey these data. 

In an internal manner, in reading, the block of 
information designated by adbloc is read in the memory RAM 
and presented at the input of the barrel shifter logic BS or 
of the multiplexing MT. The combination of signals admot and 
bit/bloc . permit the control logic COM to provide to the 
barrel shifter logic BS or the multiplexing MT the "format" 
signals. The word concerned is then right justified in the 
output of the barrel shifter logic or the multiplexing and 
presented thus on the data pins data. 

In an internal manner, in writing the word presented 
on the data lines data is aligned by the barrel shifter logic 
LS or the multiplexing MT by the same signals of the control 
format as in reading, with regard to its position in the 
block. The control logic COM emits then a partial writing 
signal wri on the only leg of the memory concerned, and to 
the address designated by the signals adbloc. 

If the signal bit/bloc is "in the "bloc" state, then 
the operation depends on the state of the normal/conf ig sig- 
nal. The config mode programs the configuration registers 
RC^, R C 2 , RC^ addressed by the signals numreg, and pro- 
grammed from the data lines data. The register RC^ permits 
modifying the size of the block r t and t/4 in the example, 
either 6 4 bits and 16 bits. In an internal manner, the func- 
tionning is similar to that described in the "bit" embodi- 
ment: t or t/4 bits are aligned on the internal bus BOSI (in 
reading) , or opposite the leg of the block concerned (in writ- 
ing) . The- sizes of the multiple blocks may be envisioned (t, 
t/2, t/4...) • 

The- register RC 3 permits selecting for. each register 
a permanent direction of operation: either in the input 
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(RC3^ 0) or in the output (RC3^ =1). This permanent dir- 
ection .permits adapting the component to series Links with 
permanent unidirectional connections. The register RC 2 . 
permits choosing for each register, under reserve that the 
corresponding bit. of RCj be at the logic state 0 f a mode of 
operation with alternate bidirectional links: on a reading 
memory RAM, the shift register RD- concerned passes to the 
output mode for the time of transmission of the block, then 
returns to the rest state in the input mode. In an internal 
manner f the flip flop BlOO^, which controls the logic gate 
PL^, is set to 1 on a loading signal of the register RDM^ 
and reset to zero upon issue of the transfer of t or t/4 
bits, through the counter CPTIOO^V initialized at t or t/4 
according to the state of the register RC^ and which 
receives on its counting input the clock pulses hi. In nor- 
mal operation ( normal/conf ig signal in the normal state) for 
a reading, the. block addressed by the pins adbloc is loaded 
in thie register RD i addressed by the numreg pins. If the 
block is partial (t/4) r then it is transmitted in a position 
of low significance on the internal bus BUSI by the barrel 
shifter type logic BS or the multiplexer MT. This block is 
then transmitted upon activation of the clock signal hi. 

In the normal operation for a writing, the contents of 
the register RD i addressed by the numreg pins is written in 
the block memory RAM with the address adbloc. If the block 
is partial, it is transmitted into a position of high signifi- 
cance on the internal bus BUSI , then aligned opposite the leg 
of the block concerned by the barrel shifter logic BS or the 
multiplexer MT, and finally a partial writing signal wri is 
sent out on the leg concerned. 

It should be noted that if a partial block is in 
service, * then the address of this partial block in the block 
is furnished by the address lines admot. 

This component is perfectly adapted to the diverse var- 
iations of the architecture described. Associated in 
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parallel, 8 f 16.*. circuits of this type permitting provision 
of the apparatus described in FIGURES 20a , 20b. If the 
memory RAM is of high speed technology, then this component 
may also be used at the level of the cache memory, while 
multiplexing, according to the device described in FIGURES 
20a, 20b, the internal registers of a same component. 
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